Mathematical tools from the abstract world of quantum fields have surprising relevance to the seemingly more concrete realm of particles in boxes.

来自量子场抽象世界的数学工具与看似更具体的盒中粒子领域有着惊人的关联.

To the uninitiated, the standard model of particle physics can seem like a random hodgepodge of particles and forces: quarks and gluons, charged leptons and neutrinos, W and Z bosons, each with its own idiosyncratic properties and behaviors. To paraphrase I. I. Rabi’s remark about the muon, “Who ordered that … or that … or that?” But a deeper dive into the theory reveals a method to the madness. Far from being arbitrary, many features of the model follow mathematically from the symmetries of the universe. Once the symmetries are known, much of the rest follows inevitably.

对于未接触过的人来说, 粒子物理标准模型似乎是一种随机的粒子和力的混合: 夸克和胶子, 带电轻子和中微子, W 和 Z 玻色子, 每种粒子都有其独特的特性和行为. 用 I. I. Rabi 关于 $\mu$ 子的话来说, “谁点的这个…或者那个…或者那个？“但对理论的深入研究揭示了这种疯狂背后的方法. 许多模型的特征并非任意的, 而是从宇宙的对称性中数学推导出来的. 一旦这些对称性被知晓, 其他的大部分内容就会不可避免地跟随而来.

The theoretical workhorse for deriving physical laws from symmetries is the gauge transformation. Roughly speaking, you start with a quantity, such as the phase of a quantum mechanical wavefunction, that doesn’t affect any physical observables, and you write it as a local function that takes different values at different points in space. Turn the mathematical crank, and out pops a physical law—in this case, a description of the existence and behavior of photons.

从对称性中推导物理定律的理论工具是规范变换. 粗略地说, 你从一个量开始, 比如量子力学波函数的相位, 它不会影响任何物理可观测量, 然后将其写成一个局部函数, 该函数在空间中的不同点取不同的值. 转动数学曲柄, 就会得到一个物理定律——在这种情况下, 是光子的存在和行为的描述.

Now, Matthias Schmidt and col leagues at the University of Bayreuth in Germany have shown that gauge transformations can also be fruitful in a seemingly disparate area of physics: statistical mechanics. They’re still exploring all the consequences of their discovery, but they’ve already uncovered a plethora of mathematical structure, equations that can help to characterize soft-matter systems, and questions about what statistical mechanical averages really mean.

现在, 德国拜罗伊特大学的 Matthias Schmidt 和他的同事们已经表明, 规范变换在看似不相关的物理领域——统计力学中也是富有成果的. 他们仍在探索他们发现的所有后果, 但他们已经发现了大量的数学结构, 可以帮助表征软物质系统的方程, 以及关于统计力学平均值到底意味着什么的问题.

Mindset shift

It all started with an offhand remark in 2019. Schmidt was working with Sophie Hermann, a new PhD student in his group, to explore the effect of a mathematical manipulation that he called “shifting.” “Sophie is a very clear and systematic thinker,” says Schmidt, “and she kept insisting that it was unclear what ‘shifting’ actually implies.”

Grasping for an answer, he appealed to a topic he’d covered in his undergraduate classical mechanics course, which Hermann had taken a few years previously: “Think of it like using Noether’s theorem,” he said. “Translational invariance in a given direction implies conservation of momentum in that direction.”

一切都始于 2019 年的一句随意的评论. Schmidt 正在与他的研究组中的新博士生 Sophie Hermann 合作, 探讨他称之为"移动"的数学操作的影响. Schmidt 说: “Sophie 是一个非常清晰和系统性思考者, 她一直坚持认为不清楚‘移动’实际意味着什么. "

为了找到答案, 他提到了他在本科经典力学课程中涉及的一个主题, 而 Hermann 几年前就已经学过了: “想象一下使用 Noether 定理, “他说. “在给定方向上的平移不变性意味着在该方向上动量守恒. "

Schmidt was referring to Emmy Noether, the foremother of modern thinking about the role of symmetry in physics. With her theorem, published in 1918, she proved that whenever a system is invariant under a continuous symmetry, it has a corresponding conserved quantity. Translational symmetry implies conservation of momentum, rotational symmetry implies conservation of angular momentum, and timetranslation symmetry implies conservation of energy.

Schmidt 指的是 Emmy Noether, 现代物理中关于对称性作用的思考的先驱. 在 1918 年发表的定理中, 她证明了每当一个系统在连续对称性下不变时, 它就有一个相应的守恒量. 平移对称性意味着动量守恒, 旋转对称性意味着角动量守恒, 时间平移对称性意味着能量守恒.

Those undergraduate-friendly examples might seem pedestrian and hardly worth mentioning, but the theorem’s implications go far deeper. Noether herself was drawn to the problem by the desire to reconcile what physicists thought they knew about classical mechanics with the new theories of special and general relativity. A relativistic universe—especially if it’s expanding—might not be translationally or time-translationally invariant, so it might not conserve momentum and energy. But it has other symmetries, and thus other conserved quantities. Noether laid the foundations for understanding it all.

这些本科生友好的例子可能看起来很平凡, 几乎不值一提, 但定理的含义更深远. Noether 本人之所以对这个问题感兴趣, 是因为她希望通过将物理学家认为他们对经典力学的了解与特殊相对论和广义相对论的新理论相协调. 一个相对论的宇宙——特别是如果它在膨胀——可能不是平移或时间平移不变的, 因此它可能不会保持动量和能量. 但它有其他的对称性, 因此有其他的守恒量. Noether 奠定了理解这一切的基础.

“It was meant to be a throwaway comment,” says Schmidt. “What I hadn’t expected was that Sophie would go back to Noether’s original paper, work through it, and come back with the conclusion that the idea actually has some real substance in it. Once that was clear, we just sat down and worked it out as clearly as we could.”

To start with a simple example, they considered shifting the position r of each particle in an ensemble by a constant vector ε. That’s not inherently a symmetry of the underlying classical mechanical system, because they envisioned the particles moving in an external energy potential $V(\mathbf{r})$ that stays put under the shift. So when the particles’ positions change, their energies do too. But when the researchers looked at the system as a statistical mechanical ensemble, something more subtle happened.

Schmidt 说: “这是一个随意的评论. 我没有预料到 Sophie 会回到 Noether 的原始论文, 仔细研究它, 并得出结论, 这个想法实际上有一些真实的内容. 一旦这一点清楚了, 我们就坐下来尽可能清楚地解决这个问题. "

为了从一个简单的例子开始, 他们考虑了将集合中每个粒子的位置 $\mathbf{r}$ 移动一个常矢量 $\varepsilon$. 这并不是基础经典力学系统的对称性, 因为他们设想粒子在一个保持不变的外部能量势 $V(\mathbf{r})$ 中运动. 因此, 当粒子的位置发生变化时, 它们的能量也会发生变化. 但当研究人员将系统视为统计力学集合时, 发生了更微妙的事情.

The basic operation of equilibrium statistical mechanics is the computation of weighted averages by integrating over all possible arrangements of individual particles, with each arrangement, or “microstate,” weighted by $e^{−E/kT}$, in which E is the total energy, $T$ is the temperature, and $k$ is Boltzmann’s constant. The weighting reflects the fact that lowenergy microstates always show up with the highest probability, but the higher-energy ones are not ruled out, especially at higher temperatures.

平衡态统计力学的基本操作是通过对所有可能的单个粒子排列进行积分来计算加权平均值, 每个排列或"微观状态"都由 $e^{−E/kT}$ 加权, 其中 $E$ 是总能量, $T$ 是温度, $k$ 是玻尔兹曼常数. 加权反映了低能量微观状态总是以最高的概率出现, 但更高能量的微观状态并未被排除, 特别是在较高的温度下.

Shifting a microstate changes its energy and, therefore, its weight in the average. As a result, it turns out, the equilibrium average—of any observable quantity—is unaffected by the shift. Shifting by ε is not a symmetry under classical mechanics, but under statistical mechanics, it is.

Mathematically, the symmetry means that in thermal equilibrium, the derivative $\mathrm{d}X/\mathrm{d}\varepsilon = 0$, no matter what $X$ is. Hermann and Schmidt took $X$ to be $\sum V$, the sum of the total potential energy of all particles—and the derivative of potential energy is just the force exerted by that potential. Ergo, in equilibrium, $\sum F_{\text{ext}}$, the sum of external forces on the system, equals zero.

移动一个微观状态会改变其能量, 因此也会改变其在平均值中的权重. 结果表明, 任何可观测量的平衡平均值——不受移动的影响. 通过 $\varepsilon$ 移动在经典力学下不是一个对称性, 但在统计力学下是.

从数学上讲, 这种对称性意味着(系统)在热平衡状态下, 无论 $X$ 是什么, 导数 $\mathrm{d}X/\mathrm{d}\varepsilon = 0$. Hermann 和 Schmidt 将 $X$ 取为 $\sum V$, 所有粒子的总势能之和——势能的导数就是该势能施加的力. 因此, 在平衡状态下, 系统上外力的总和 $\sum F_{\text{ext}}$ 等于零.

That might seem boringly obvious. If $\sum F_{\text{ext}}$ were not zero, the system would start to move, which would mean it hadn’t been in equilibrium after all. But as the researchers pointed out, $\sum F_{\text{ext}}=0$ is not true for most of the individual microstates. Rather, it’s a nontrivial statement about the nature of thermal equilibriuma so-called sum rule—and Noether’s theorem offered a new way of proving it.

这可能看起来显而易见. 如果 $\sum F_{\text{ext}}$ 不为零, 系统将开始运动, 这意味着它根本不处于平衡状态. 但正如研究人员指出的那样, $\sum F_{\text{ext}}=0$ 对于大多数单个微观状态来说并不成立. 相反, 这是关于热平衡性质的一个非平凡陈述——所谓的求和规则——而 Noether 定理提供了一种新的证明方法.

From global to local

With subsequent waves of group members over the past five years—including Florian Sammüller and Johanna MüllerSchmidt and Hermann continued to develop the theory. In particular, says Schmidt, “Noether’s theorem comes in two flavors, local and global. We’d started with global shifts, but the real powerhouse is the local version.”

Generalizing from global to local symmetry would mean changing ε from a single vector to a position-dependent function $\varepsilon(\mathbf{r})$. In general, shifting by $\varepsilon(\mathbf{r})$ is not a statistical mechanical symmetry: The shift spreads out some particles and moves others closer together. The distortion leaves the respective microstates either overrepresented or underrepresented in the integral over all microstates.

在过去的五年里, 随着一波波的团队成员——包括 Florian Sammüller 和 Johanna MüllerSchmidt 和 Hermann 继续发展这一理论. Schmidt 说: “Noether 定理有两种版本, 局部和全局. 我们从全局移动开始, 但真正的强大之处在于局部版本. "

从全局对称性推广到局部对称性意味着将 $\varepsilon$ 从一个单一矢量变为一个位置相关函数 $\varepsilon(\mathbf{r})$. 一般来说, 通过 $\varepsilon(\mathbf{r})$ 移动不是一个统计力学对称性: 移动会使一些粒子分散开来, 而使另一些粒子靠近. 这种扭曲会使各自的微观状态在所有微观状态的积分中要么过表示, 要么欠表示.

But the thermal average is an integral over not just the particles’ positions but also their momenta. So the Bayreuth researchers introduced a corresponding momentum transform, as shown in figure 1, that compensated for the effect of the position shift: Where $\varepsilon(\mathbf{r})$ spread the positions apart, the momenta were correspondingly compressed, and vice versa. As a result, the position–momentum shift (still referred to as simply $\varepsilon$ for brevity) once again left the equilibrium averages of all observables unchanged.

Unraveling the consequences of the local symmetry follows similar lines. Now, $\mathrm{d}X/\mathrm{d}\varepsilon(\mathbf{r})$ is what’s called a functional derivative—a derivative with respect to a function—but just like an ordinary derivative with respect to a number, it can still be set to zero for any observable $X$. Moreover, one can study the second derivatives with respect to $\varepsilon$ to generate higher-order sum rules that relate the spatial correlations among forces and other quantities.

但热平均值是对粒子的位置和动量的积分. 因此, 拜罗伊特的研究人员引入了一个相应的动量变换, 如图 1 所示, 以补偿位置移动的影响: 当 $\varepsilon(\mathbf{r})$ 使位置分开时, 动量相应地被压缩, 反之亦然. 结果, 位置-动量移动（为简洁起见仍然简称为 $\varepsilon$）再次使所有可观测量的平衡平均值保持不变.

解开局部对称性的后果遵循类似的线路. 现在, $\mathrm{d}X/\mathrm{d}\varepsilon(\mathbf{r})$ 是所谓的泛函导数——对函数的导数——但就像对数的普通导数一样, 它仍然可以对任何可观测量 $X$ 设置为零. 此外, 可以研究关于 $\varepsilon$ 的二阶导数, 以生成关于力和其他量之间的空间相关性的高阶求和规则.

“All these sum rules just say, ‘Zero equals zero,’ " says Schmidt, “or ‘These two things add up to zero,’ where one is an obvious everyday object, and the other is some strange correlation function that you’d never otherwise think of measuring. But it’s really worth it to study them, because they can tell you a lot about the system you’re looking at.”

For example, for their first foray into exploring the consequences of the local symmetry, they looked at simulations of liquids and gels. Those two forms of matter have obvious differences on the macroscale, but it can be tricky to relate their properties to what’s happening on the microscale. “The natural thing to want to measure is the shell structure–how likely particles are to be some distance apart,” says Sammüller. That quantity, plotted as $g$ in the upper panels of figure 2, is qualitatively similar between model liquids and model gels.

Schmidt 说: “所有这些求和规则都只是说, ‘零等于零’, 或者‘这两个东西加起来等于零’, 其中一个是一个明显的日常对象, 另一个是一些奇怪的相关函数, 你从未想过要测量. 但研究它们真的很值得, 因为它们可以告诉你很多关于你正在研究的系统的信息. "

例如, 对于他们首次探索局部对称性后果的尝试, 他们观察了液体和凝胶的模拟. 这两种物质在宏观尺度上有明显的差异, 但将它们的性质与微观尺度上发生的事情联系起来可能会有些棘手. Sammüller 说: “想要测量的自然事物是壳结构——粒子之间相隔多远的可能性. " 这个量, 作为图 2 上子图中的 $g$ 绘制, 模型液体和模型凝胶之间在定性上是相似的.

But when the researchers differentiated energy twice with respect to $\varepsilon(\mathbf{r})$, they got a sum rule relating derivatives of $g$ to correlations of forces $g_{ff}$ and force gradients $g_{\nabla f}$. The correlations would be hard to measure in real fluids, but they’re certainly measurable in simulations and possibly even in experiments on micron-sized colloids. And as the bottom panels show, they’re starkly different between liquids and gels, and the quantities that the sum rule predicts to be equal really are.

“These quantities that come out of the analysis can be very sensitive to various important physical mechanisms,” says Sammüller. “They might even be useful for designing liquids with tailored properties.”

但当研究人员对能量关于 $\varepsilon(\mathbf{r})$ 两次求导时, 他们得到了一个关于 $g$ 的导数与力的相关性 $g_{ff}$ 和力梯度 $g_{\nabla f}$ 的求和规则. 这些相关性在真实流体中很难测量, 但它们在模拟中肯定是可测量的, 甚至在微米级胶体的实验中也是可能的. 正如底部面板所示, 它们在液体和凝胶之间有着明显的不同, 而求和规则预测相等的量确实是相等的.

Full circle

“We could have continued like this,” says Schmidt, “with a new paper for every observable: ‘Now we can do this for energy, now for kinetic energy,’ and so on.” But when Müller joined the group, she brought with her a master’s degree in mathematics—and the tools to show just how general the shifting theory really was.

The universe of all possible shifts $\varepsilon(\mathrm{r})$, it turned out, forms a mathematical structure called a Lie algebra (named after Norwegian mathematician Sophus Lie—nothing to do with prevarication). Lie algebras turn up in many other areas of physics and mathematics, including in the gauge transformations from particle physics. “Dealing with gauge invariance and Lie algebras is such a standard thing in other areas,” says Schmidt, “and it helps us to better understand, assess, and manage the implications of the mathematics.”

Schmidt 说: “我们本可以继续这样下去, 对每个可观测量写一篇新论文: ‘现在我们可以对能量做这个, 现在对动能做这个’, 等等. " 但当 Müller 加入团队时, 她带来了数学硕士学位——以及展示移动理论实际上有多么普遍的工具.

所有可能移动 $\varepsilon(\mathrm{r})$ 的宇宙, 原来形成了一个称为李代数的数学结构（以挪威数学家 Sophus Lie 命名——与说谎(lie)无关）. 李代数出现在许多其他物理和数学领域, 包括粒子物理中的规范变换. Schmidt 说: “在其他领域处理规范不变性和李代数是一件很标准的事情, 它帮助我们更好地理解、评估和管理数学的含义. "

In particular, the Lie algebra structure sets clear boundaries on the types of sum rules that the $\varepsilon(\mathrm{r})$ shifts can generate. No matter what observable quantity the researchers start with or which functional derivatives they calculate, they’ll end up with a sum rule involving correlations of forces and other specific force-like quantities. “These do form a hierarchy of increasing complexity, but the complexity is within the limits set by the Lie algebra,” says Schmidt. “The sum rules don’t proliferate into an uncontrolled, everincreasing range of quantities that they relate to each other.”

The implications of the $\varepsilon(\mathrm{r})$ shifts were falling into place, but there remained the matter of Hermann’s original question: What does the shift really mean? “Gauge invariance is a brutal thing somehow,” says Schmidt, “because it says that all these things that one can reach with the gauge transformation are really the same.” That is, the gauge transformation is more than a mere mathematical manipulation: The transformed and untransformed versions of the system are physically indistinguishable, which means they’re also physically equivalent.

特别是, 李代数结构为 $\varepsilon(\mathrm{r})$ 移动可以生成的求和规则类型设定了明确的边界. 无论研究人员从哪个可观测量开始, 或者计算哪些泛函导数, 他们最终都会得到一个涉及力的相关性和其他特定类似力的数量的求和规则. Schmidt 说: “这确实形成了一个逐渐复杂的层次结构, 但这种复杂性是在李代数设定的限制范围内的. 求和规则不会无限制地扩散到与彼此相关的数量的不断增加的范围. "

$\varepsilon(\mathrm{r})$ 移动的含义正在逐渐清晰, 但仍然存在 Hermann 最初的问题: 移动到底意味着什么？Schmidt 说: “规范不变性在某种程度上是一种残酷的事情, 因为它说所有这些可以通过规范变换达到的东西实际上是相同的. “也就是说, 规范变换不仅仅是一种简单的数学操作: 系统的变换和未变换版本在物理上是无法区分的, 这意味着它们在物理上是等价的.

Other common targets of gauge transformations, such as quantum fields and electromagnetic potentials, are already such abstract entities that it’s relatively easy to accept that one way of writing them down is no more physically real than any other. Statistical mechanics seems different in that regard, because classical intuition gives rise to mental movies of ensembles of particles zipping around in boxes. Those microstates might seem too concrete to exist as part of an $\varepsilon(\mathrm{r})$-shifted equivalence class: Surely one set of positions and momenta must be the real one?

“It’s absolutely weird that gauge invariance also applies in this context, and it’s hard to get your head around,” says Schmidt. “But it’s the averages we’re taking that are the abstract thing. The movies aren’t real—they’re just one very specific illustration. It’s possible to look at a system too accurately.”

度规变换的其他常见目标, 如量子场和电磁势, 已经是如此抽象的实体, 以至于人们相对容易接受这样一种写法并不比其他写法在物理上更真实. 统计力学在这方面似乎有所不同, 因为经典的直觉会让人在脑海中浮现出在盒子里飞来飞去的粒子集合体. 这些微观状态似乎过于具体, 不可能作为一个 $\varepsilon(\mathrm{r})$ 移位等价类的一部分而存在: 肯定有一组位置和力矩是真实的吗？

Mindset shift#

From global to local#

Full circle#

Mindset shift

From global to local

Full circle