Second-order theory as a square: what you gain, what you lose
- Definitions / Notation used
- Square-root logic
- One technical lemma (structure of the second-order equation)
- What you gain by squaring
- What you lose by squaring
- Assumptions vs consequences
- Why this matters
- Key takeaway
- Technical takeaway
First-order equations are incisive: they define a tight solution space and often come with good deformation theory. They are also unforgiving: small changes in modeling choices (boundary conditions, gauge-fixing, numerical discretization) can push you off the solution set in a way that is hard to control. Squaring the first-order object $\Upsilon_\omega$ is the standard move that trades “sharp constraints” for “energy-like control.”
Definitions / Notation used
- $\Upsilon_\omega := \bullet _\varepsilon(F_B) − \kappa_1 T$ is the first-order field object on $Y$ ($\mathrm{ad}(P_H)$-valued).
- Define the squared action:
$$ I_2(\omega) := \int_Y ⟨\Upsilon_\omega, \ast_Y \Upsilon_\omega⟩. $$
Square-root logic
There are two logically distinct statements:
1) If $\Upsilon_\omega = 0$, then $\omega$ is a stationary point of $I_2$.
This is immediate: the integrand is quadratic in $\Upsilon_\omega$, so if $\Upsilon_\omega$ vanishes pointwise, any first variation of $I_2$ vanishes.
2) If $\omega$ is a stationary point of $I_2$, then $\Upsilon_\omega = 0$.
This is false in general. Stationary points of a square include $\Upsilon_\omega = 0$ solutions, but can also include configurations where $\Upsilon_\omega$ is nonzero yet satisfies the second-order Euler–Lagrange equation (a covariant “divergence-free” condition). This is the precise sense in which “first-order implies second-order,” but not conversely.
So the square-root slogan here is not mystical: it is a strict inclusion of solution sets: ${\Upsilon_\omega = 0} ⊂ {EL(I_2) = 0}$.
One technical lemma (structure of the second-order equation)
Lemma (Second-order Euler–Lagrange has the form “adjoint derivative of $\Upsilon$”).
Under variations of $\omega$ that respect the boundary conditions, the first variation of $I_2$ can be written schematically as
$$ \delta I_2(\omega) = 2 \int_Y \langle \delta \omega, 𝓛_\omega^†(\Upsilon_\omega)\rangle, $$
so the Euler–Lagrange equation for $I_2$ is
$$ 𝓛_\omega^†(\Upsilon_\omega) = 0, $$
where $𝓛_\omega$ is the linearization of the map $\omega ↦ \Upsilon_\omega$ (hence depends on $A_0$, $\varepsilon$, the Shiab operator, and the $\sigma$-split metric through $\ast_Y$, and $𝓛_\omega^†$ is its formal adjoint with respect to the $⟨·,·⟩/\ast_Y$ pairing.
Proof sketch.
$I_2 = \int ⟨\Upsilon, \ast_Y \Upsilon⟩$. Varying gives $\delta I_2 = 2 \int ⟨\delta \Upsilon, \ast_Y \Upsilon⟩$. But $\delta \Upsilon = 𝓛_\omega (\delta\omega)$ by definition of the linearization. Move $𝓛_\omega$ off $\delta\omega$ by adjunction to obtain the displayed form (plus boundary terms that vanish under the assumed support/decay or imposed boundary conditions). No componentwise “Ricci tracing” occurs: everything is packaged in the covariant linearization of $\Upsilon$.
What you gain by squaring
1) A functional that is naturally suited to numerics.
Even in split signature, $I_2$ is the canonical “least-squares” objective: it measures failure to satisfy the first-order equation. This is exactly the structure you want if you are doing continuation methods, Newton–Krylov solvers, or constrained minimization.
2) A direct bridge to EFT thinking.
Expanding $I_2$ around a background solution $\omega_0$ gives a quadratic form governed by the linearized operator $𝓛_{\omega_0}$. That is the entry point to propagators, effective operators, and mode suppression.
3) A cleaner path to quantization heuristics (without claiming success yet).
Path integrals over $\omega$ weighted by $exp(−I_2)$ (or its Lorentzian analogue) are the standard story. In a split-signature ambient space, the real work is to identify the correct involution/projection that yields a well-behaved quadratic form on the propagating sector. Squaring is necessary, not sufficient, but it is the move that makes the question well-posed.
What you lose by squaring
1) You enlarge the solution space.
The first-order equation $\Upsilon_\omega=0$ is a strong geometric constraint. The second-order equation $𝓛_\omega^†(\Upsilon_\omega)=0$ allows “harmonic” $\Upsilon_\omega$ configurations: nonzero, but divergence-free in the appropriate covariant sense. Whether those extra branches are physically relevant, gauge artifacts, or pathological depends on boundary conditions and the sector ($E$-block vs everything). You do not get to ignore this.
2) You obscure the geometric meaning.
$\Upsilon_\omega=0$ is a direct balance law: Shiab-contracted curvature equals $\kappa_1$ times torsion. The second-order equation reads like “a differential operator applied to that balance law vanishes.” That is less interpretable. It is not worse; it is just further from the conceptual anchor.
3) You inherit the ambient signature problem in a sharper form.
On a $(7,7)$ manifold, inner products are not automatically positive. If you want $I_2$ to behave like an energy, you must specify the pairing/involution that selects the physical sector (and, in our instantiation, you will do that through the gravitational block $E$ and the pullback-visible modes). Until that is spelled out, any positivity language is propaganda. Here we keep it neutral: $I_2$ is the natural square; its analytic character depends on the sector.
Assumptions vs consequences
Assumptions:
- Same geometric and gauge setup as before ($\mathrm{Spin}(7,7)$, $\sigma$-split, $A_0$ fixed, $\bullet_\varepsilon$ fixed via $E/\Theta_E$, $D\Theta_E=0$).
- Boundary conditions that kill integration-by-parts terms (compact $Y$, or decay, or explicit boundary term choices).
- A specified adjoint/inner product structure for defining $𝓛_\omega^†$ (this is where split signature matters).
Consequences:
• First-order solutions $\Upsilon_\omega=0$ are automatically solutions of the second-order EL equation. • Second-order solutions include (possibly many) additional branches with $\Upsilon_\omega \neq 0$ but $𝓛_\omega^†(\Upsilon_\omega)=0$. • The squared action is the right object for perturbation theory and numerical “residual minimization,” but it does not replace the conceptual primacy of the first-order equation.
Why this matters
If the project is going to produce an EFT corner, a numerical fitting program, or any credible discussion of fluctuations, you will end up linearizing something and controlling error norms. $I_2$ is that control functional. The first-order equation is the geometric statement; the square is the engineering interface. Keeping both, and being explicit about what each one buys you, is how we avoid overpromising.
Key takeaway
Squaring gives you a robust second-order theory whose solutions include all first-order solutions, but also potentially more.
Technical takeaway
$I_2(\omega) = \int_Y \langle \Upsilon_\omega, \ast_Y Υ_\omega⟩$, with $\Upsilon_\omega = 0 ⇒ EL(I_2): 𝓛_\omega^†(\Upsilon_\omega)=0$, but not conversely.
Loading comments…