TensorLens: End-to-End Transformer Analysis

Updated 1 February 2026

TensorLens is a framework that represents a transformer as an input-dependent linear operator using a high-order attention–interaction tensor to encapsulate all computational components.
It reformulates multi-head attention, layer normalization, feed-forward networks, and residuals into a unified linear Jacobian through precise vectorization techniques.
Empirical tests demonstrate that TensorLens outperforms traditional methods in visualization, probing, and manipulation of transformer behaviors, supporting tasks like model distillation and ablation.

TensorLens is a theoretical and practical framework for end-to-end transformer analysis, centered on the construction of a high-order attention–interaction tensor that encodes a full transformer as a single, input-dependent linear operator. This tensorial representation captures all computational components of a transformer block—including multi-head attention, feed-forward networks (FFN), layer normalizations, and residual connections—in a unified, expressive formalism. TensorLens provides both the mathematical apparatus and empirical tools for interpretability, visualization, manipulation, and probing of transformer architectures, overcoming limitations of previous attention-aggregation methodologies (Atad et al., 25 Jan 2026).

1. Mathematical Formulation of the High-Order Attention–Interaction Tensor

TensorLens formulates a vanilla $N$ -layer transformer, at fixed input $X$ , as an input-conditioned linear operator $T \in \mathbb{R}^{L\times D\times L\times D}$ , where $L$ is the sequence length and $D$ the hidden dimension, so that $F(X)=T(X)$ and

$\mathrm{vec}[F(X)] = T_{\mathrm{mat}} \cdot \mathrm{vec}[X],$

with $T_{\mathrm{mat}} \in \mathbb{R}^{(LD)\times(LD)}$ a matrix unfolding of $T$ . The model-wide tensor $T$ is the ordered product of per-layer block-tensors $X$ 0: $X$ 1 and thereby

$X$ 2

Each $X$ 3 encapsulates attention, both residual and non-residual pathways, layer normalizations, and FFN operations as a single linear transformation: $X$ 4 where $X$ 5 represents the multi-head attention tensor, $X$ 6 and $X$ 7 are the two layer normalization tensors, $X$ 8 is the FFN linearization tensor, and $X$ 9 the identity. The Kronecker-products and diagonalizations required to form these sub-tensors are derived explicitly for each operation, with the entire construction being local—i.e., functionally dependent on the specific input instance $T \in \mathbb{R}^{L\times D\times L\times D}$ 0 by using statistics (e.g., layernorm means, variances, activation slopes) observed on $T \in \mathbb{R}^{L\times D\times L\times D}$ 1.

2. Stepwise Derivation and Structural Intuition

The derivation exploits standard vectorization identities to recast all individual sub-layer computations into the form $T \in \mathbb{R}^{L\times D\times L\times D}$ 2, specifically:

Self-attention: Combines token–token interactions (length–length) and feature–feature correlations (dimension–dimension) as a sum of Kronecker products $T \in \mathbb{R}^{L\times D\times L\times D}$ 3, where $T \in \mathbb{R}^{L\times D\times L\times D}$ 4 is the per-head attention matrix and $T \in \mathbb{R}^{L\times D\times L\times D}$ 5 are value/output projections.
Layer Normalization & FFN: Conditioned on fixed input $T \in \mathbb{R}^{L\times D\times L\times D}$ 6 (so $T \in \mathbb{R}^{L\times D\times L\times D}$ 7 are frozen), both LayerNorm and FFN become data-dependent diagonal linear operators.
Residuals: Incorporated via additive identity; vectorized as $T \in \mathbb{R}^{L\times D\times L\times D}$ 8.
Compositionality: Stacking all blocks yields a nested or concatenated product of the blockwise $T \in \mathbb{R}^{L\times D\times L\times D}$ 9.

This linearization is locally faithful—by definition, $L$ 0 is the exact Jacobian of the transformer's forward function, patched such that all nonlinearities (softmax, LayerNorm, activation slopes) are “frozen” at values computed on $L$ 1.

3. Computational Construction and Example Pseudocode

To compute $L$ 2 for a given input $L$ 3, TensorLens uses automatic differentiation. The approach fixes nonlinearities (i.e., computes and freezes softmax weights, norms, and activation slopes at $L$ 4) and computes the output’s total Jacobian with respect to the input: $\mathrm{vec}[F(X)] = T_{\mathrm{mat}} \cdot \mathrm{vec}[X],$ 0 A simple worked example (e.g., $L$ 5, $L$ 6, $L$ 7, $L$ 8) is constructed by directly plugging in $L$ 9 weight matrices and applying the four Kronecker and diagonalization operations.

4. Comparison to Previous Attention Aggregation Methodologies

TensorLens differs fundamentally from earlier aggregation schemes. The following table summarizes its relation to major prior approaches:

Method	Included Components	Notable Omissions
Attn (head-averaging)	$D$ 0 per layer	Projections, residuals, FFN, LN
Rollout [Abnar & Zuidema]	Chained averages	As above
Value-weighted [Kobayashi]	Incorporates $D$ 1	LayerNorm, residuals, FFN
W.AttnResLN [Kobayashi '21]	Residuals, first LN	FFN, second LN
GlobEnc [Modarressi '22]	Two LNs added	FFN
TensorLens	All linear ops, input/output embeddings, activations	—

Only TensorLens:

Is fully principled, incorporating all linear operations, both LayerNorms, both FFN projections, activation slopes, residual adds, and embeddings.
Is exact (first-order) at a given $D$ 2, as it is the literal Jacobian of the model’s patched forward function (local error bounded by Proposition 1).
Offers flexible axis collapses to derive generalized or specialized $D$ 3 attention maps that subsume previous variants.

5. Empirical Evaluation and Applications

Extensive empirical tests demonstrate that TensorLens provides superior fidelity and interpretability compared to previous aggregation schemes.

Perturbation Tests: On DeiT-Base/Small (ImageNet), TensorLens-based maps (“Tensor,Norm” and “Tensor,In+Out”) achieve AUC $D$ 4 0.66/0.82 (versus $D$ 50.60 for any non-tensor baseline). For BERT-family and Gemma3 models on IMDB, TensorLens AUC $D$ 6 0.10/0.16 ( $D$ 70.09 for non-tensor baselines). On decoder-only LLMs (Pythia-1B, Pico-570M, Phi-1.5 on WikiText-103), TensorLens is top-1 or top-2 by HS-MSE, AOPC metrics.
Relation Decoding: Averaging per-example tensors $D$ 8 yields a relation-specific linear map, matching or exceeding the Linear Relation Extraction (LRE) baseline on Pythia-1B, which only considers $D$ 9 embedding.
Interpretability and Visualization: By collapsing $F(X)=T(X)$ 0 to $F(X)=T(X)$ 1 attention maps (via norms, in+out embeddings, or per-class output projections), TensorLens recovers or extends attribution maps for input token importance. Examples include $F(X)=T(X)$ 2, $F(X)=T(X)$ 3, and $F(X)=T(X)$ 4.
Manipulation and Distillation: $F(X)=T(X)$ 5 (as the local linearization of $F(X)=T(X)$ 6 at $F(X)=T(X)$ 7) is directly usable for linear distillation (cf. “LoLCats” by Zhang et al. ’24). Model interventions can be effected by masking subtensors within $F(X)=T(X)$ 8, with immediate re-evaluation of collapsed attention maps.

A memory-efficient Jacobian-slice implementation, as well as full code and worked examples, are available at https://github.com/idoatad/TensorLens.

6. Theoretical Guarantees and Scope

TensorLens is theoretically grounded, representing the first complete, input-dependent, high-order tensor formalization of a transformer’s global linear behavior. It encapsulates all prior “extended attention” proposals as strict special cases—achievable via particular axis reductions or omission of components. Proposition 1 in the source material provides local error bounds for the Jacobian approximation. The framework operates directly with input and output embeddings, and includes the capacity to trace, ablate, or visualize the influence of any model subcomponent within $F(X)=T(X)$ 9 at the granularity of tokens, neurons, or projection subspaces.

A plausible implication is that TensorLens may serve as a foundational analytic tool for the next generation of mechanistic interpretability and model-editing methodologies, providing fine-grained, exact, and extensible representations of transformer computations.

Reference: [TensorLens: End-to-End Transformer Analysis via High-Order Attention Tensors, (Atad et al., 25 Jan 2026)]

Markdown Report Issue Upgrade to Chat

References (1)

TensorLens: End-to-End Transformer Analysis via High-Order Attention Tensors (2026)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to TensorLens.