Part II — Dataset audits (facts first, interpretation second)

II.1 Dataset A: FTLE heterogeneity Gλ

Dataset A — Gλ grid (NL_Glambda_trend.csv)

A.1.1 What the dataset actually contains (fact check)

From your description and usage:

This is internally consistent.

II.1.1 Experimental grid

We evaluate the FTLE field λT(x) on a fixed 2D grid and compute:

Gλ:=Varxμgrid[λT(x)].

Parameters:

II.1.2 Empirical facts

From the data:

  1. Width dependence
    For fixed L:
NGλ.
  1. Depth dependence at moderate/large width
    For fixed N50:
LGλ.
  1. Narrow-width anomaly (N=10)

These are purely observational statements.

II.1.3 Immediate but safe interpretation

Because Gλ measures variance of λT(x) over the grid:

No claim is made yet about feature learning.

II.2 Dataset B: Kernel and representation alignment (KA / RA)

Dataset B — RA / KA grid (RA_KA_NL_dataset.csv)

A.2.1 What the dataset actually contains

This is important: RA/KA measure total rotation over training, not instantaneous dynamics.

II.2.1 Definitions (reminder)

Kernel alignment:

KA=Kinit,KfinalF|Kinit|F|Kfinal|F.

Representation alignment:

RA=Zinit,ZfinalF|Zinit|F|Zfinal|F.

Both measure rotation away from initialization, not magnitude change.

II.2.2 Empirical facts

From the data:

  1. Width dependence
NRA,KA (toward 1).
  1. Depth dependence
LRA,KA,

with the strongest effect at small N.

  1. Wide-but-deep regime
RA,KA<1.

II.2.3 Meaning (strictly limited)

II.3 Cross-dataset summary (no synthesis yet)

From Part II alone:

This establishes a tension, not a contradiction:

This tension motivates the theory development in later parts.

Status after Part II (progress bar)