DYNAMICAL SYSTEMS IN PURE MATHEMATICS A Dissertation Presented to the Faculty of the Graduate School of Cornell University in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy by Max Lipton May 2023 © 2023 Max Lipton ALL RIGHTS RESERVED DYNAMICAL SYSTEMS IN PURE MATHEMATICS Max Lipton, Ph.D. Cornell University 2023 The author’s research program involves several topics which differ at first glance. However, they all share the common theme of exploring how geometry and topology influences dynamical equilibria. The dissertation is broken into three parts: the hyper- bolic geometry of higher-dimensional Kuramoto oscillators, electrostatic knot theory, and minimal surfaces with Möbius energy on the boundary. Each part is further di- vided into chapters adapting the author’s preprints and published papers, which have appeared in journals in physics, applied mathematics, and pure mathematics. BIOGRAPHICAL SKETCH Max Lipton was born in Salem, Oregon on June 18, 1994 as the only child to Stephen and Mythuan “Mitoo” Lipton. His father was a public criminal defense attorney, and his mother was a refugee of the Vietnam War who later became a certified medical technician. After graduating from South Salem High School, Max attended Willamette University and earned a B.A., double majoring in mathematics and computer science. During his junior year, Max spent a semester abroad in Russia, where he participated in the Math in Moscow exchange program. Despite having visited every state in the US as a teenager, except for Alaska and Hawaii, this trip was the first time he left North America. He is about to graduate with a Ph.D. in mathematics from Cornell University, ad- vised by Steve Strogatz. After a “wrestling match” unlike anything his advisor has ever seen in his thirty years of advising, Max was awarded an NSF Mathematical Sci- ences Postdoctoral Research Fellowship at the Massachusetts Institute of Technology. He is set to marry his beloved Courtney Elizabeth on October 21, 2023. iii To Courtney. iv ACKNOWLEDGEMENTS There are so many people who have contributed to my mathematical development. Everywhere I have been, I had the pleasure of meeting many excellent teachers, men- tors, and friends who helped me in big ways and small. I owe all of them my deepest gratitude. In particular, without the opportunity and support provided by the late Bob Strichartz and his Analysis on Fractals REU, I would never have made it to Cor- nell. I’m also incredibly grateful for the support of my thesis advisor Steve Strogatz, who I am proud to also call my friend. I would also like to give special thanks to the following people who, through one way or another, left a lasting impact on my in- tellectual and personal development: Wayne Atwood, Warren Trotter, Inga Johnson, Colin Starr, Fritz Ruehr, Charleen Gust, Gill Grindstaff, Aaron Calderon, Max Hall- gren, Kelly Delp, Alex Townsend, Rennie Mirollo, Gokul Nair, and Xin Zhou. The secret to life is enjoying the company of those you meet along the way. I would also be remiss if I didn’t acknowledge the monumental support given from my mom, dad, and Courtney. And last, but not least, I would like to give a very special shout-out to the cats who have provided me with much love, support, and mathematical insight: Latitude, Gratitude, Amplitude “Ampi”, Plus, Minus, Equals, and Infinity-Algebra “Fifi.” (My parents came up with the names). Every meow is special. v TABLE OF CONTENTS Biographical Sketch . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iii Dedication . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iv Acknowledgements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . v Table of Contents . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vi 1 Introduction 1 I The hyperbolic geometry of higher-dimensional Kuramoto os- cillators 4 2 The Kuramoto model on a sphere 5 2.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 2.2 Preliminaries . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9 2.2.1 The Kuramoto Model on a Sphere . . . . . . . . . . . . . . . . . . 9 2.3 Reduction of system parameters . . . . . . . . . . . . . . . . . . . . . . . 10 2.4 Hyperbolic geometry and Möbius transformations . . . . . . . . . . . . . 13 2.5 Infinitesimal generators . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13 2.6 Comparison of Z versus W coordinates . . . . . . . . . . . . . . . . . . . 17 2.7 Continuum limit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18 2.8 Complex case . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22 2.9 Relation to previous research . . . . . . . . . . . . . . . . . . . . . . . . . 24 2.10 An example: First-order linear order parameter gives gradient system . 26 2.10.1 Computer visualization . . . . . . . . . . . . . . . . . . . . . . . . 27 2.10.2 Existence of hyperbolic gradient . . . . . . . . . . . . . . . . . . . 29 2.10.3 Analysis of dynamics . . . . . . . . . . . . . . . . . . . . . . . . . 31 II Electrostatic knot theory 39 3 A lower bound on the critical points of the electric potential of a knot 40 3.1 Preliminaries and problem formulation . . . . . . . . . . . . . . . . . . . 40 3.2 Preliminary definitions and lemmas . . . . . . . . . . . . . . . . . . . . . 41 3.3 Proof of the tunnel number bound . . . . . . . . . . . . . . . . . . . . . . 45 3.3.1 Construction of the tunneling . . . . . . . . . . . . . . . . . . . . . 45 3.3.2 Constructing a smooth boundary around the tunneling . . . . . . 48 3.3.3 The deformation retraction to a handlebody . . . . . . . . . . . . 51 4 Further topological approaches to knotted electric charge distributions 53 4.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53 4.2 Bifurcations through degenerate critical points . . . . . . . . . . . . . . . 58 4.3 A partial sharpening of the tunnel number bound . . . . . . . . . . . . . 61 vi 4.4 The relationship between the critical set and the Morse code of equipo- tential surfaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64 4.5 Concluding Remarks and Future Directions . . . . . . . . . . . . . . . . . 70 5 Specific charge distributions and perspectives in physics 71 5.1 Motivations from physics . . . . . . . . . . . . . . . . . . . . . . . . . . . 71 5.2 Charged loops in the plane: Specific examples . . . . . . . . . . . . . . . 72 5.2.1 Rectangle . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72 5.2.2 Stadium . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75 5.2.3 Ellipse . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77 5.3 Charged Knots . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78 5.3.1 Background . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78 5.3.2 Physical intuition about equipotential surfaces and equilibrium points . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79 5.3.3 Numerical methods . . . . . . . . . . . . . . . . . . . . . . . . . . 82 5.3.4 Results and conjectures for charged knots . . . . . . . . . . . . . 84 5.4 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89 III Plateau problems with Möbius energy on the boundary 90 6 Minimizers of the Möbius-Plateau energy 91 6.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91 6.2 Problem Formulation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92 6.3 Energy Minimization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93 6.3.1 Lower Semicontinuity of EP . . . . . . . . . . . . . . . . . . . . . . 96 6.3.2 Equicontinuity at the Boundary . . . . . . . . . . . . . . . . . . . . 97 6.3.3 Convergence in the Interior . . . . . . . . . . . . . . . . . . . . . . 100 6.4 Möbius-Plateau Stationary Helix Pairs . . . . . . . . . . . . . . . . . . . . 101 7 Stationary helix pairs and complex asymptotics of the Möbius gradient 115 7.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 115 7.2 Möbius-stationary symmetric double helices . . . . . . . . . . . . . . . . 118 7.3 Möbius-Plateau stationary symmetric screws . . . . . . . . . . . . . . . . 128 Bibliography 132 vii CHAPTER 1 INTRODUCTION When everything is said and done Still looking for answers, if only one Turn my back, the urge has gone Left with no reason, we come undone Rob Swire, The Island Dynamical systems are differential equations. Differential equations are vector fields. And vector fields are topology. This perspective is mathematically reduc- tive, but it is an essential scientific heuristic for building bridges between disciplines whose respective research communities are generally disjoint. My dissertation strad- dles several different fields in pure and applied mathematics, but the common thread is exploring how topology and geometry influence dynamical equilibria. Employing techniques from areas including knot theory, calculus of variations, and dynamical systems, I seek to build new bridges between disciplines. I also implement numerical simulation and computer visualization to assist in the formulation of new conjectures which are then proven rigorously. Each part and chapter stands independently, and is adapted from the preprints and publications I’ve contributed to throughout my grad- uate career. I have obtained what I hope are considered to be significant milestone results, but by no means are any of these lines of inquiry complete. I believe all three topics are manifestations of deep mathematical phenomena yet to be unraveled. The first part is about the hyperbolic geometry of higher-dimensional Kuramoto oscillators. Kuramoto dynamics describe systems of coupled particles on S 1 which show long-term synchronization behavior under generic initial conditions. The re- sults of Watanabe-Strogatz yield constants of motion which greatly reduce the sys- 1 tem’s dimension [128]. Later work by Marvel-Mirollo-Strogatz, Chen-Engelbrecht- Mirollo, and Lohe [20, 74, 80] show these constants of motion have an interpretation in terms of hyperbolic geometry. A paper I co-authored with Mirollo and Strogatz deals with higher-dimensional oscillators living on S d rather than S 1 [69]. We used the generalized Watanabe-Strogatz transform to reduce the dynamics to the Lie group of hyperbolic isometries of the sphere’s Poincaré ball interior, whilst giving the condi- tions for a linearly coupled Kuramoto system to synchronize. The second part concerns electrostatic knot theory, which investigates the poten- tial induced by a knotted curve with uniform electric charge distributed on it. The overarching objective is to find relationships between classical knot invariants and the dynamical properties of the electric field on the complement. One particular result I proved so far is a lower bound on the potential’s critical set based on the tunnel number [66] using methods from Morse theory and geometric topology. A recent pa- per I co-authored with Strogatz and Townsend expands this result by analyzing some explicitly given knot parametrizations [71]. The third and final part discusses Plateau problems with Möbius boundary en- ergy, a class of problems in the calculus of variations seeking to minimize novel con- vex combinations of two well-known energies associated with a curve: its Möbius energy and the area of its minimally spanning surface. A seminal paper of Freedman- He-Wang proves several nice properties of the Möbius energy, including regularity of minimizers and invariance under Möbius transformations [34]. In a paper co- authored with Nair [70], we proved the existence of minimizers of our novel Möbius- Plateau energy. In future work, Nair and I hope to use this result to deduce boundary self-avoidance in other geometric optimization settings, particularly the Euler-Plateau problems from elasticity theory. To add concrete examples to our theory, I searched 2 for critical helicoidal strips and proved a characterization of these strips via hard esti- mates of a certain oscillatory integral in the Euler-Lagrange equation [70] in addition to complex asymptotic methods [67]. 3 Part I The hyperbolic geometry of higher-dimensional Kuramoto oscillators 4 CHAPTER 2 THE KURAMOTO MODEL ON A SPHERE This chapter originally appeared as “The Kuramoto model on a sphere: Explaining its low-dimensional dynamics with group theory and hyperbolic geometry” by Lipton, Mirollo, and Strogatz in Chaos Vol. 31, Iss. 9 [69]. 2.1 Introduction In 1975, Kuramoto introduced a model for a large population of coupled oscilla- tors with randomly distributed natural frequencies. [60] Kuramoto’s model displayed many remarkable features: It was exactly solvable (at least in some sense), despite being nonlinear and infinite-dimensional. [61] Its solution shed analytical light on a phase transition to mutual synchronization that Winfree had previously discovered in a similar but less convenient system of oscillators. [132, 133] Since then, the Kuramoto model has been an object of fascination for nonlinear dynamicists, as well as a sim- plified model for many real-world instances of coupled oscillators in physics, biology, chemistry, and engineering. [120, 102, 119, 1, 30, 101, 108, 10] From a mathematical standpoint, one of the most intriguing problems has been to explain the tractability of the Kuramoto model. What symmetry or other hidden structure accounts for its solvability? The first clues came from work on an adjacent topic: series arrays of N identi- cal overdamped Josephson junctions. The governing equations for those supercon- ducting oscillators are closely related to the equations of the Kuramoto model [130, 131], and themselves displayed remarkable dynamical features, such as surprisingly low-dimensional invariant tori[125, 122] and ubiquitous neutral stability of splay 5 states[87], despite the presence of damping and driving in the governing equations. These features were explained in 1993 by the discovery of a certain change of variables, now called the Watanabe-Strogatz transformation [127, 128], which showed that the governing equations have N − 3 constants of motion for all N ≥ 3. Goebel [38] then pointed out that the Watanabe-Strogatz transformation could be viewed as a time- dependent version of a linear fractional transformation, a standard tool in complex analysis. For more than a decade, however, these results did not attract much atten- tion, perhaps because they were assumed to be restricted to problems about Josephson junctions, and within that specialized setting, even further restricted to junctions that were strictly identical. A breakthrough occurred in 2008 with the work of Ott and Antonsen. [96, 97] They found an astonishing way to capture the macroscopic dynamics of the infinite-N Kuramoto model, even when the oscillators’ frequencies were non-identical and ran- domly distributed. First, they wrote down an ansatz — seemingly pulled out of thin air — for the density function ρ(θ, ω, t) of oscillators having phase θ and intrinsic fre- quency ω at time t. Their ansatz had the form of a time-dependent Poisson density (a density better known for its role in the study of partial differential equations, specifi- cally for the solution of Laplace’s equation on a disk, given the values of the unknown function on the bounding circle). By making this ansatz of a Poisson density, Ott and Antonsen reduced the infinite-N Kuramoto model, an integro-partial differential equation, to an infinite set of coupled ordinary differential equations. Then, by further assuming that the intrinsic frequencies of the oscillators were randomly distributed according to a Lorentzian (Cauchy) distribution, Ott and Antonsen showed that the order parameter dynamics of the Kuramoto model could be reduced tremendously, all the way down to an ordinary differential equation for a single scalar variable, the amplitude of the order parameter. [96] With this discovery, the floodgates were now 6 open. Almost immediately the Ott-Antonsen ansatz was used to solve many long- standing problems about the Kuramoto model and its variants, as well as to generate and solve many new problems. [101] Still, a lot of old questions hung in the air. Both the Watanabe-Strogatz transfor- mation and the Ott-Antonsen ansatz appeared somewhat unmotivated and almost miraculous. Where did they come from, and why did they work? It also was not clear whether they were connected or perhaps even equivalent. There were reasons to doubt they were linked: the Watanabe-Strogatz transformation could be used for any finite N ≥ 3, but seemed restricted to identical oscillators, whereas the Ott-Antonsen ansatz allowed for non-identical oscillators but seemed restricted to the continuum limit of infinite N. Also, why were linear fractional transformations and Poisson den- sities — tools from other branches of mathematics — popping up in these studies of dynamical systems? Later work made sense of all of this. The Josephson arrays and the Kuramoto model both turned out to have deep mathematical ties to group theory, hyperbolic ge- ometry, and projective geometry, and both the Watanabe-Strogatz transformation and the Ott-Antonsen ansatz were tapping into these structures. [101, 100, 80, 117, 20, 21] For the Josephson arrays, the governing equations turned out to be generated by a group action, specifically the action of the Möbius group of linear fractional trans- formations of the unit disk to itself. Seen in this light, the constants of motion for the Josephson arrays were cross-ratios, and the invariant tori were group orbits. The same group-theoretic structure was found to underlie the Kuramoto model (in the special case where all the oscillator frequencies are identical) as well as other sinusoidally coupled systems of identical phase oscillators.[80, 19] In the past few years, several researchers wondered how far this story could be 7 pushed. Are there quantum or higher-dimensional extensions of the Kuramoto model that might show similar reducibility? A number of results along these lines have now been found. [72, 73, 123, 23, 24, 42, 43, 74, 52, 18, 17, 75, 28, 44, 76, 41, 53, 27] In par- ticular, several researchers have explored a generalization of the Kuramoto model in which the oscillators move on spheres instead of the unit circle; the spheres could be either the ordinary two-dimensional sphere or higher-dimensional spheres. In par- ticular, a system of particles moving on the two-sphere has been used to model the orientation dynamics of swarms of drones flying around in three dimensions. [93] As we shall demonstrate, these higher-dimensional oscillator models exhibit a dynami- cal reduction generalizing the reduction for the Kuramoto model. Consequently, the computational effort required to simulate these systems can be substantially reduced. A counterpart of the Ott-Antonsen ansatz has also been discovered for the continuum version of the Kuramoto model for identical oscillators on the d-dimensional sphere, and used to reduce its infinite-dimensional dynamics to a lower dimensional set of ordinary differential equations (ODEs). [17] But as before, some of the results appear disconnected and a bit miraculous. Our goal in this paper is to show that hyperbolic geometry and group theory can unify and clarify our understanding of the Kuramoto model on a sphere and make all the latest results seem natural, just as they did before for the traditional Kuramoto model. We focus exclusively on the case of identical oscillators, though we expect that our methods will extend to systems with multiple populations of identical oscil- lators, or a continuum of oscillator families with a distribution of natural frequencies. Our approach explains the model’s reducibility for any finite number of oscillators, as well as for the continuum limit, and it reveals why Poisson densities arise again in this setting. There is a close connection to Laplace’s equation and harmonic analysis, as we will see in Section V. We also find that complex analysis is not really essential, 8 which is just as well, since it does not generalize to the higher-dimensional spheres being considered here. Instead, the proper mathematical setting is harmonic analysis and hyperbolic geometry on higher-dimensional balls. Our work also allows us to go beyond merely unifying existing results. For instance, by establishing that linearly coupled systems of identical Kuramoto oscillators on a sphere have a hyperbolic gra- dient structure, we can prove new global stability results about convergence to the synchronized state, as described in Section VIII. 2.2 Preliminaries 2.2.1 The Kuramoto Model on a Sphere In a pioneering paper, Lohe [72] observed that there are at least two natural gen- eralizations of the Kuramoto model to higher dimensions. One of them replaces the phases θ j of the original Kuramoto model[60, 61] with complex numbers exp(iθ j) on the unit circle and then views those as equivalent to 2 × 2 rotation matrices parametrized by a rotation angle θ j. From there, it is a natural step to consider other Lie groups of matrices, many of which are non-Abelian. Our concern in this paper, however, is with a different generalization of the Ku- ramoto model. Instead of regarding oscillators as particles moving on the unit circle, we think of them as particles moving on the unit sphere. The sphere could be the sur- face of the ordinary unit ball in three dimensions, or some higher-dimensional sphere S d−1 in Rd. When d = 2, the sphere reduces to the unit circle in the plane, and the model reduces to the original Kuramoto model. The governing equations for the Kuramoto model on a sphere are ẋi = Aixi + Z − ⟨Z, xi⟩xi, i = 1, . . . ,N, (2.1) 9 where xi is a point on the unit sphere S d−1 ⊂ Rd, each Ai is an antisymmetric d×d matrix, and Z ∈ Rd is a d-dimensional vector analogous to the complex order parameter for the classic Kuramoto model. In Eq. (2.1), the matrix Ai and the vector Z are functions of the configuration (x1, . . . , xN) of points on the sphere. Note that Z does not depend on i; like the usual Kuramoto order parameter, it plays the role of a mean-field quantity that couples all the “oscillators” xi together. The antisymmetric matrix Ai is the higher- dimensional counterpart of an intrinsic frequency ωi in the original Kuramoto model. A straightforward computation shows that the dot product between an oscillator’s instantaneous position and instantaneous velocity satisfies ⟨xi, ẋi⟩ = 0, which proves that oscillators that start on the unit sphere stay on it forever. The state space for this system is the N-fold product X = (S d−1)N , which has dimension N(d − 1). Later we will also consider the natural infinite-N analogue of (2.1), where a state is a probability measure on S d−1. In what follows, we allow Z to be any smooth function on the state space X, though in examples we usually restrict to fairly simple functions, like a linear combination of the form Z = N∑ i=1 aixi where the ai are real constants. 2.3 Reduction of system parameters There is a general technique for dimensional reduction of systems which we pause to describe. Suppose we have a smooth manifold X, which we think of as a state space, and a group G acting on X, where G is also a smooth manifold (in other words, G is a Lie group). Then the group action induces a space of vector fields on X, the so-called infinitesimal generators of the action. 10 To construct these generators, let γ(t) be a smooth curve in G with γ(0) = e, the identity element in G. Then the derivative γ̇(0) = v, where v is a vector in the tangent space TeG of G at e. This vector v is in turn associated very naturally with a corre- sponding vector ṽ in the tangent space of X, as follows. For each x ∈ X, t 7→ γ(t)x is a smooth curve in X, and its derivative at t = 0 defines a vector ṽx in the tangent space at x. The vector field ṽ = (ṽx) is the infinitesimal generator corresponding to the element v in the tangent space of G at e. At each point x ∈ X, the infinitesimal generators vx span a linear subspace Vx of the tangent space TxX, which is exactly the set of vectors tangent to the group orbit Gx at the element x. Now suppose we have a vector field ξ on X, which defines a dynamical system on the state space X. If ξx ∈ Vx at each point x ∈ X, then the flow corresponding to the vector field ξ will be constrained to lie on the group orbits Gx. If the dimension of G is less than the state space X, then this will give us a dimensional reduction of the dynamics from dim X to dim G. Now suppose, as is the case in applications of this methodology, the correspondence G → Gx is one-to-one for generic x ∈ X; equivalently, the stabilizer subgroup Gx = {e} for generic x ∈ X. Then for each x ∈ X, the flow on the group orbit Gx is equivalent to a flow on G, which is a lower-dimensional space than the state space X. Here is a familiar example: Consider the orthogonal group G = S O(d) consisting of orientation-preserving linear isometries of Rd. (For an intuitive picture, think of these isometries as rotations.) Then G acts on Rd, and the corresponding infinitesimal generators are the linear vector fields νx = Ax, where A is any skew-symmetric matrix. We can also let G act on the product space X = (Rd)N of N-tuples x = (xi), xi ∈ Rd, and then the infinitesimal generators have the form (νx)i = Axi for some skew-symmetric matrix A. We could also, if we like, restrict X to N-tuples (xi) with xi ∈ S d−1, the state 11 space for (2.1). Now suppose we had a dynamical system on X of the form ẋi = Axi, where the skew-symmetric matrix A is a function of the configuration x = (xi); this is just the special case of (2.1) with Z = 0. Then the dynamics on X reduces to dynamics on G, which has dimension d(d − 1)/2, and for large N this is much smaller than the dimension of X, which is N(d − 1). Basically the configuration (xi) of points on the sphere S d−1 can only move collectively by a rotation of the sphere, so the dynamics reduces to a dynamical system on S O(d). We want to apply this methodology to the system (2.1). But since that system generally has Z , 0, we need a different group action to make this strategy work. Fortunately, vector fields of the form seen in the Kuramoto model, ẋ = Ax + Z − ⟨Z, x⟩x, (2.2) turn out to arise as the infinitesimal generators of the group action of a larger group G acting on the sphere S d−1 and its interior, the unit ball Bd. This larger group is the Möbius group of isometries of the hyperbolic geometry on Bd. It contains the orthogonal group as a proper subgroup, but has bigger dimension d(d + 1)/2. So for the Kuramoto system (2.1), if the matrices Ai all happen to be identical, we can reduce the dynamics of the system to a much smaller system on this Möbius group G, and use this reduction to understand the dynamics on the larger state space X. This is the essence of the approach we take. Ultimately we apply it to prove a synchro- nization theorem for the system (2.1) for the special order parameter Z = ∑N i=1 aixi with ai > 0. But first we need to show that vector fields on S d−1 of the form in (2.2) are indeed the infinitesimal generators of the action of the larger Möbius group. 12 2.4 Hyperbolic geometry and Möbius transformations In this paper, a Möbius transformation is a composition of Euclidean isometries and spherical inversions of Rd mapping the unit ball homeomorphically to itself and pre- serving orientation. This is a more restrictive definition than the commonly defined Möbius transformations which in general do not need to preserve the unit ball. As in the case d = 2, flows of the form (1) are intimately related to the natural hyperbolic geometry on the unit ball Bd with boundary S d−1. This geometry has metric ds = 2|dx| 1 − |x|2 , where |dx| is the ordinary Euclidean metric. Isometries are assumed to be with respect to this hyperbolic geometry, unless otherwise qualified as Euclidean. The metric ds has constant (sectional) curvature equal to −1, and we can describe its isometries, which generalize the Möbius transformations preserving the unit disc for d = 2. For d = 2, let w ∈ B2 and consider the Möbius transformation Mw(x) = x − w 1 − wx , which preserves the unit disc B2 and its boundary S 1. 2.5 Infinitesimal generators Having parametrized the Möbius transformations from a prior section, we are now ready to derive the associated infinitesimal generators of the Möbius group action on the ball Bd. We will show that they correspond to flows of the form ẏ = Ay − ⟨Z, y⟩y + 1 2 (1 + |y|2)Z, (2.3) where A is an antisymmetric d × d matrix and Z ∈ Rd is a vector. Note that, as adver- tised, this flow extends to a flow on S d−1 of the Kuramoto form in (2.2), as we can see 13 by restricting (2.3) to vectors y on the unit sphere where |y| = 1. To derive (2.3), we work separately with the boost and rotation components. Let us start with the boost component. Replace w by tw and expand Mtw(x) to first order in t: Mtw(x) ≈ x − |x|2tw 1 − 2t⟨w, x⟩ − tw ≈ x + t ( 2⟨w, x⟩x − (1 + |x|2)w ) . The derivative of this expression at t = 0 (which is just the coefficient of t) gives us the infinitesimal generator: it is an “infinitesimal boost” of the form (2.3) with Z = −2w and A = 0. Next, recall that the infinitesimal generators corresponding to the rotation components are flows of the form ẋ = Ax for an antisymmetric matrix A. Together with the infinitesimal boosts we then get all flows of the form (2.3). The group G acts on the space X in the natural way (component by component) and the infinitesimal generators of this group action on X are flows of the form (1) with all Ai identical. Therefore, by the general philosophy discussed earlier, the evolution of any initial point p ∈ X under the system (1) with all Ai = A lies in the group orbit Gp. The given Kuramoto system has N(d − 1) degrees of freedom, for some large N. However, since the flow of the system is determined via an action of the d(d + 1)/2 dimensional Lie group G, we can alternatively study the auxiliary dynamical system on G, which we call the reduced equations. By ignoring rotations, we can further restrict our attention to a system on the d-dimensional quotient G/S O(d) � Bd. The dimensional reduction not only makes the reduced equations easier to analyze than the original Kuramoto system, but the reduced equations require fewer computational resources to numerically integrate. As before, assume all the terms Ai in (1) are equal, so that Ai = A for some skew- symmetric matrix A. Fix a base point p = (p1, . . . , pN) ∈ X. Then if the points pi are in sufficiently general position, every element in the G-orbit of p can be expressed 14 uniquely as gp for some g ∈ G, with parameters w, z and ζ. We wish to derive the corresponding evolution equations for w, z and ζ. Let (xi(t)) be any solution to (1) in the group orbit Gp; we do not require that the initial point (xi(0)) = p. Then for i = 1, . . . ,N we have xi(t) = gt(pi) for a unique gt ∈ G, which determines the parameters w, z, ζ as functions of t. Now consider the equation (2.3), with coefficients A and Z evaluated at (xi(t)). This is a non-autonomous ODE on Bd, and its time-t flow must be given by some g̃t ∈ G. This ODE has solutions xi(t) = gt(pi) = gt(g−1 0 (xi(0))), which implies that g̃t = gtg−1 0 . So for any y0 ∈ Bd, y(t) = gt(g−1 0 (g0(y0))) = gt(y0) = ζMw(y0) = M−z(ζy0) must satisfy the ODE (2.3) with A and Z evaluated at (xi(t)) at time t. In particular, if we let y0 = 0, then y(t) = −ζw = z, so z satisfies the ODE (2.3). Now expand y = ζMw(y0) = M−z(ζy0) to first order in y0, using the variables z and ζ: y ≈ z + (1 − |z|2)ζy0, so ẏ ≈ ż − 2⟨ż, z⟩ζy0 + (1 − |z|2)ζ̇y0. On the other hand, (2.3) gives ẏ = Ay + 1 2 ( 1 + |y|2 ) Z − ⟨Z, y⟩y ≈ Az + 1 2 (1 + |z|2)Z − ⟨Z, z⟩z + (1 − |z|2) (Aζy0 + ⟨z, ζy0⟩Z − ⟨Z, z⟩ζy0 − ⟨Z, ζy0⟩z) . Setting y0 = 0 gives the ż equation ż = Az + 1 2 (1 + |z|2)Z − ⟨Z, z⟩z (2.4) 15 as expected, and since ⟨Az, z⟩ = 0, this in turn implies that ⟨ż, z⟩ = 1 2 (1 − |z|2)⟨Z, z⟩. Equating the y0 terms, factoring out 1 − |z|2 and canceling the common term ⟨Z, z⟩ζy0 gives ζ̇y0 = Aζx0 + ⟨z, ζy0⟩Z − ⟨Z, ζy0⟩z. Together, the last two terms above define a special type of antisymmetric operator of ζy0: Given any y1, y2 ∈ Rd, define the antisymmetric operator α as α(y1, y2)y = ⟨y1, y)y2 − ⟨y2, y)y1; this operator has range = span(y1, y2) providing y1 and y2 are linearly independent; otherwise α(y1, y2) = 0. Then for all y0 ∈ Rd, ζ̇y0 = Aζy0 + α(z,Z)ζy0 and therefore ζ̇ = (A + α(z,Z))ζ. Differentiating z = −ζw gives Az + 1 2 (1 + |z|2)Z − ⟨Z, z⟩z = −ζẇ − ζ̇w so ζẇ = (A + α(z,Z))z − Az − 1 2 (1 + |z|2)Z + ⟨Z, z⟩z = Az + |z|2Z − ⟨Z, z⟩z − Az − 1 2 (1 + |z|2)Z + ⟨Z, z⟩z = − 1 2 (1 − |z|2)Z; hence ẇ = − 1 2 (1 − |w|2)ζ−1Z. (2.5) 16 Summing up, the evolution equations for the (z, ζ) coordinate system on Gp are ż = Az + 1 2 (1 + |z|2)Z − ⟨Z, z⟩z (2.6a) ζ̇ = (A + α(z,Z))ζ, (2.6b) with A and Z evaluated at M−z(ζp), and for the (w, ζ) coordinate system on Gp are ẇ = − 1 2 (1 − |w|2)ζ−1Z (2.7a) ζ̇ = (A − α(ζw,Z))ζ, (2.7b) with A and Z evaluated at ζMw(p). Note that these equations generalize the evolution equations for the parameters w and ζ given in Chen et al.[20] for the classic case d = 2. 2.6 Comparison of Z versus W coordinates The ż equation (2.4) is an extension of the system equation (2.1) on S d−1. However, for finite N, the ż equation does not uncouple from ζ, since Z is evaluated at M−z(ζp). The exception to this is in the infinite-N limit: if the base point p is now the uniform density on S d−1, then ζp = p (the uniform density is invariant under rotations) and the density M−z(p) is a hyperbolic Poisson density on S d−1 whose centroid is a function of z. In the case d = 2, this Poisson density has centroid z. Unfortunately this simple relationship is false for d ≥ 3 (we will give more details on this in the next section). The advantage of the ẇ equation (2.5) is that for an order parameter function of the form Z = N∑ i=1 aixi, with ai ∈ R, ζ drops out of the ẇ equation and we get the reduced equation ẇ = − 1 2 (1 − |w|2)Z(Mw(p)). 17 The parameter w essentially defines the “phase relations” among the xi; two configu- rations have the same w if and only if they are related by a rotation. So w is the key parameter that determines whether the system is approaching synchrony or incoher- ence. The w variable also has a nice invariance under change of base points. Suppose p′ = M(p) ∈ Gp; then we have coordinates w′, ζ′ associated to the base point p′. Any q ∈ Gp has two expressions q = ζMw(p) = ζ′Mw′ p′ = ζ′Mw′(M(p)). Assuming the coordinates of p are in sufficiently general position, this implies ζMw = ζ′Mw′ ◦ M, and hence 0 = ζMw(w) = ζ′Mw′(M(w)). But the unique solution to Mw′(y) = 0 is w′, and hence w′ = M(w). In other words, the coordinates w and w′ transform exactly as the base points p and p′. 2.7 Continuum limit Next, we consider the dynamics of the Kuramoto model (2.1) in the limit N → ∞. We assume that the rotation terms Ai = A are constant across the population, corresponding to identical “oscillators.” Let us also assume that the order parameter Z is is proportional to the centroid of the population: Z = K N N∑ i=1 xi In the continuum limit, a state of the system is a probability measure ρ on S d−1, and the order parameter becomes Z = K ∫ S d−1 x dρ(x). 18 The measure ρ evolves according to the continuity equation (also known as the noise- less Fokker-Planck equation) associated to the flow in (1). Naturally, this flow must preserve group orbits under the action of G. Recall that if M ∈ G, then the measure M∗ρ is defined by the adjunction formula∫ S d−1 f (x) d(M∗ρ)(x) = ∫ S d−1 f (M(x)) dρ(x). In particular, we can consider the G-orbit of the uniform probability measure σ on S d−1. This orbit is special; whereas a typical group orbit Gρ has dimension equal to the dimension of G, namely d(d + 1)/2, the orbit Gσ has dimension only d. This is because the stabilizer of σ is S O(d); any rotation fixes σ, whereas the boosts deform σ. Hence the orbit Gσ has dimension d. Any element in Gσ can be written as (M−z)∗σ, with z ∈ Bd. The evolution equation for z is (2.4), with Z(z) = K ∫ S d−1 x d(M−z)∗σ(x) = K ∫ S d−1 M−z(x) dσ(x). (2.8) In the case d = 2 with x = ζ ∈ S 1, we have dσ(ζ) = 1 2πi dζ ζ , so the integral Z(z) = K 2πi ∫ S 1 ζ + z 1 + zζ · dζ ζ = K ζ + z 1 + zζ ∣∣∣∣∣∣ ζ=0 = Kz by the Cauchy integral formula. Therefore (2.4) simplifies to the equation ż = iωz + K 2 (1 − |z|2)z when d = 2. Unfortunately, the formula Z(z) = Kz is not correct for d ≥ 3; though as we shall see later, this formula is correct in higher dimensions for the complex hyperbolic model in even dimensions, which we discuss in the next section. For d = 2 the two geometries agree, which explains the coincidence for d = 2. 19 Any Riemannian manifold X has a Laplace-Beltrami operator ∆ associated to its metric; functions f on X satisfying the equation ∆ f = 0 are called harmonic. For func- tions on the ball Bd with the hyperbolic metric, this operator is ∆hyp = (1 − |x|2)2∆euc + 2(d − 2)(1 − |x|2) d∑ i=1 xi ∂ ∂xi , where ∆euc = d∑ i=1 ∂2 ∂x2 i is the standard Laplace operator (see Stoll[118], Chapter 3). We will call solutions to the equation ∆hyp f = 0 hyperbolic harmonic functions; for d = 2 these coincide with ordinary (Euclidean) harmonic functions. We can consider the hyperbolic analogue of the classical Dirichlet problem: given a continuous function f on S d−1, extend f to a hyperbolic harmonic function f̃ on Bd. Assuming this problem has a unique solution, then for any rotation ζ ∈ S O(d) we must have f̃ ◦ ζ = f̃ ◦ ζ, since rotations preserve the hyperbolic metric. If we average f ◦ ζ on S d−1 over all rotations ζ ∈ S O(d) we get the constant function fave = ∫ S d−1 f (x) dσ(x) on S d−1, and any constant is hyperbolic harmonic on Bd. Therefore the average on Bd of f̃ ◦ ζ = f̃ ◦ ζ over all ζ ∈ S O(d) must be the constant fave. But f̃ (ζ(0)) = f̃ (0) for all ζ, so we must have f̃ (0) = ∫ S d−1 f (x) dσ(x). Now let z ∈ Bd; since M−z preserves the hyperbolic metric, we must have f̃ ◦ M−z = f̃ ◦ M−z, which implies f̃ (z) = f̃ ◦ M−z(0) = ∫ S d−1 f (M−z(x)) dσ(x) = ∫ S d−1 f (x) d((M−z)∗σ)(x). 20 As shown in Chapter 5 in Stoll [118], the measure (M−z)∗σ is given by the formula d((M−z)∗σ)(x) = Phyp(z, x) dσ(x), with hyperbolic Poisson kernel function Phyp(z, x) = ( 1 − |z|2 |z − x|2 )d−1 . (2.9) Thus the solution to the hyperbolic Dirichlet problem with boundary function f on S d−1 is given by the hyperbolic Poisson integral f̃ (z) = ∫ S d−1 Phyp(z, x) f (x) dσ(x), z ∈ Bd. The orbit Gσ consists of all hyperbolic Poisson measures P(z, x) dσ(x), parametrized by z ∈ Bd. By contrast, the Euclidean Poisson kernel function is Peuc(z, x) = 1 − |z|2 |z − x|d , so the hyperbolic Poisson measures agree with the Euclidean Poisson measures only if d = 2. Now we can calculate the expression Z(z) in the general case d ≥ 2. We see from (2.8) that Z(z) is the hyperbolic Poisson integral of the function Kx on S d−1. The func- tion Kx is (Euclidean) harmonic and homogeneous of degree 1 on Rd; following the recipe in Chapter 5 in Stoll [118], we see that its extension from S d−1 to a hyperbolic harmonic function on Bd is given by Z(z) = K F(1, 1 − d/2; 1 + d/2; |z|2) F(1, 1 − d/2; 1 + d/2; 1) z, (2.10) where F is the hypergeometric function F(a, b; c; t) = ∞∑ k=0 (a)k(b)k (c)k tk k! , with (a)0 = 1 and (a)k = a(a + 1) · · · (a + k − 1) for k ≥ 1. Notice that if a or b = 0, then F(a, b; c; t) = 1; this gives Z(z) = Kz for d = 2, as expected. 21 2.8 Complex case There is an alternative generalization of Kuramoto networks to higher- dimensional oscillators when d = 2m is even. Then Rd = Cm, and we can study systems of the form ẋ j = A jx + Z − ⟨x j,Z⟩x j, i = 1, . . . ,N, (2.11) where now xi is a point on the unit sphere S 2m−1 ⊂ Cm, Ai is an anti-Hermitian m × m complex matrix, Z ∈ Cm and ⟨, ⟩ denotes the complex-valued Hermitian inner product. These systems are the same as the real case when d = 2,m = 1 but are different for m ≥ 2. To see this, suppose Ax + Y − ⟨x,Y⟩R x = Bx + Z − ⟨x,Z⟩C x for all x ∈ S 2m−1 ⊂ Cm = Rd, where A is antisymmetric, B is anti-Hermitian, Y,Z ∈ Cm and we use the subscripts R and C to distinguish the real and complex inner products. Then (A − B)x = Z − Y + ( ⟨x,Y⟩R − ⟨x,Z⟩C ) x and so (A − B)(−x) = (A − B)x for all x ∈ S 2m−1, which implies A = B. This implies Y − Z = ( ⟨x,Y⟩R − ⟨x,Z⟩C ) x for all x ∈ S 2m−1, hence Y − Z ∈ spanC(x) for all x ∈ Cm; if m ≥ 2, this implies Y = Z. But then we have ⟨x,Y⟩R = ⟨x,Y⟩C for all x ∈ Cm, which can only hold if Y = 0. Hence for m ≥ 2, the only flows simulta- neously of the form (1) and (2.11) have Z = 0 and A anti-Hermitian. Flows of the form (2.11) are related to the complex hyperbolic geometry on the complex unit ball Bm with the Bergman metric (see Rudin[110], Chapter 1). The 22 orientation-preserving isometries of this metric are generated by unitary transforma- tions ζ ∈ U(m) and boost transformations of the form Mw(x) = √ 1 − |w|2 x + ( ⟨x,w⟩ 1+ √ 1−|w|2 − 1 ) w 1 − ⟨x,w⟩ = x − w + ⟨x,w⟩w−|w| 2 x 1+ √ 1−|w|2 1 − ⟨x,w⟩ . Notice that when m = 1, this reduces to the standard complex Möbius map Mw. As in the real case M0 is the identity, M−1 w = M−w, Mw(w) = 0 and Mw(0) = −w. Any orientation-preserving isometry of Bd can be expressed uniquely in the form g(x) = ζMw(x) = M−z(ξx), where w, z ∈ Bm but now ζ, ξ ∈ U(m), the complex unitary group. Linearizing at x = 0 gives g(x) ≈ ζ −w − ⟨x,w⟩w + √ 1 − |w|2 x + ⟨x,w⟩w 1 + √ 1 − |w|2  ≈ ζ −w + √ 1 − |w|2 x − √ 1 − |w|2⟨x,w⟩w 1 + √ 1 − |w|2  ≈ z + √ 1 − |z|2 ξx − √ 1 − |z|2⟨ξx, z⟩z 1 + √ 1 − |z|2 which implies z = −ζw (hence |z| = |w|) and ξ = ζ, as before. The corresponding infinitesimal transformations are given by flows on the complex unit ball Bm of the form ẏ = Ay + Z − ⟨y,Z⟩y, (2.12) with A anti-Hermitian m × m and Z ∈ Cm. This flow extends to a flow on S 2m−1 of the form in (2.11). Note the absence of the quadratic term |y|2Z here. To derive these infinitesimal transformations, we can apply our prior power series expansion, noting that |x| = 1 to obtain Mtw(x) ≈ x − tw 1 − t⟨x,w⟩ ≈ x + 2 (⟨x,w⟩x − w) t. 23 Figure 2.1: A first-order linear Kuramoto system on the two-dimensional sphere S 2 with equal weights ai = 1/N, and randomly chosen initial conditions. The states shown are at t = 0, t = 10, and t = 40 respectively. This simulation was written in Python and visualized with Plotly. So the infinitesimal generator is an “infinitesimal boost” of the form (2.12) with Z = −1 2w and A = 0. The infinitesimal generators corresponding to the rotation components are flows of the form ẋ = Ax with A anti-Hermitian; together with the infinitesimal boosts we get all flows of the form (2.12). We mention in passing that the complex model (2.11) has a natural quantum net- work analogue, studied by Lohe [73], in which the complex vector xi is replaced by a normalized wave function |ψi⟩. We expect that our reduction techniques extend to this infinite-dimensional quantum model. 2.9 Relation to previous research Many of the results above can be found in some form in the work of earlier au- thors. [72, 123, 23, 43, 74, 52, 18, 17, 75, 41, 53] Three papers in particular overlap considerably with the present work. Tanaka [123] demonstrates in his 2014 paper that the dynamics of (2.1) can be re- duced using Möbius transformations that fix the unit ball, similar to what Marvel, Mirollo, and Strogatz found[80] for the traditional Kuramoto model. Tanaka writes his Möbius transformations differently from ours, but he uses the same group of trans- 24 formations and he also gets reduced equations for his Möbius parameters. Tanaka’s equation (10b) looks similar to our ż equation (2.4), except without the |z|2 term, which is puzzling. He does not mention the reduction down to dimension d in the finite-N case that we get with the ẇ equation (2.5). Tanaka also notes that the complex case when d = 2m is different, and generalizes the Ott-Antonsen residue calculation to this case, which is the highlight of his paper. In the real case, Tanaka’s equation (15) is sim- ilar to our equation (2.10), though we were not able to show that the two expressions are equivalent. Finally, Tanaka also presents a generalization of the Ott-Antonsen reduction[96] for the complex version of the system. Lohe [74] also looks at the same system as (1) (see his equation (22)) and he derives a similar reduction as ours by using Möbius transformations for the finite-N model. His transformation (30) on S d−1 is our Mw (with v = w) and his equation (31) is the same as our ż equation (2.4). He also has something that looks like the ẇ equation (2.5), which he says is independent of the rest of the reduced system for (in our notation) an order parameter function of the form Z = 1 N N∑ i=1 λiQixi, where Qi ∈ O(d) and λi ∈ R. But such a Z does not satisfy the identity ζZ(p) = Z(ζp) for all rotations ζ, unless Qi = ±I, so we do not see how the ζ term cancels. Lohe’s map M in his equation (55) (ignoring the R factor) agrees with our map M−v on the sphere S d−1, but not on the ball Bd. So it is not a Möbius transformation of the type we are using. For example, M(−v) = v whereas M−v(−v) = 0. We are not sure why Lohe [74] prefers these maps over the boosts; he claims that M preserves cross- ratios, but we do not see why this is advantageous. His map F in equation (63) (again ignoring the R factor) is exactly our M−v. Chandra, Girvan, and Ott [17] concentrate on the infinite-N or continuum limit 25 Figure 2.2: A first-order linear Kuramoto system on S 2 with weights distributed ac- cording to a Riemann sum which approximates the integral of a normal distribution, and randomly chosen initial conditions. Pink particles contribute to the order parame- ter with greater weights than the blue particles do. The states shown are at t = 0, t = 10, and t = 40 respectively. system, and derive a dynamical reduction for a special class of probability densities on S d−1, generalizing the Poisson densities used in the Ott-Antonsen reduction. They proceed directly to the infinite-N version of (1). They make a very clever guess (their equation (7)) of the form of the special densities that generalize the Poisson densities for d = 2, and then calculate the exponent in the denominator of their expression, getting exactly the hyperbolic Poisson kernel densities in (2.9) above. Their equation (15) is exactly the same as our ż equation (2.4) in the infinite-N limit. The integral in their equation (19) can be evaluated, as shown above in (2.10). 2.10 An example: First-order linear order parameter gives gradient system We conclude with an analysis of the system (1) with a weighted order parameter Z = N∑ i=1 aixi, (2.13) where the ai are real constants. 26 Figure 2.3: A first-order linear Kuramoto system on S 2 with a majority cluster, where one particle is chosen to have a weight which exceeds the combined weights of all other particles (or equivalently, where all the particles have equal weight but a ma- jority of them cluster into a single point and therefore act if they were a single giant particle; hence the name “majority cluster”). The states shown are at t = 0, t = −10, and t = −40 respectively; we have chosen to depict time running backward to high- light that the backwards-time limit tends toward an antipodal configuration. In this simulation, one particle, depicted in pink, was chosen to have a weight of 0.6, and the remaining 99 particles, depicted as blue, were chosen to have equal weights of 0.4/99. 2.10.1 Computer visualization We have been discussing a system of particles on a sphere that coalesce (in other words, they spontaneously synchronize) when certain conditions are met. To visualize this coalescence, we implemented the Runge-Kutta algorithm to numerically solve the Kuramoto model on the 2-dimensional sphere in 3-space, corresponding to the case d = 3 in (2.1). For simplicity, we simulated N = 100 particles with equal weights (ai = 1/N in the order parameter Z) and set the rotation term A to zero for all the particles, a choice that is tantamount to ignoring the rotational influence, or equivalently, rotating the frame of reference along with the entire system as it evolves. In the simulation shown in Fig. 2.2, randomly chosen points on the sphere were used as initial conditions. As time increases, one can see that the particles coalesce to a limit point, mimicking the spontaneous synchronization that is well known for the traditional Kuramoto model (d = 2) when the oscillators are identical. Later in this section, we will prove this synchronization behavior holds more gen- 27 erally for Kuramoto models on the sphere having weighted order parameters of the form (2.13), provided the weights ai are all positive and sum to 1 and no individual weight exceeds 1/2. A partial result in this direction was obtained by Choi et al. [24] These authors prove that in the case where all ai are equal and positive, initial con- ditions satisfying ⟨xi(0),Z(0)⟩ > 0 for all i will synchronize. Geometrically, this condi- tion is equivalent to requiring that the oscillators all lie on the hemisphere given by ⟨xi, y⟩ > 0 for some vector y , 0. More generally, similar partial synchronization results are obtained by Ha et al. [41] for the case Z = (aI +W) N∑ i=1 xi, with W skew-symmetric, and by Ha and Park [44] for the complex system (2.11) with Z = ∑ xi. Figure 2.2 shows a simulation in which we weighted each particle according to the terms in a Riemann sum approximating the integral of the normal probability distri- bution. The pink particles, which have higher weights, exert greater influence over the final synchronization location of the particles, but there is still synchronization. When time runs backwards, almost all initial conditions of the particles tend to- wards a limiting configuration where their centroid is at the origin. The exception is when we have a majority cluster, depicted in Fig. 2.3, where one particle has a weight which exceeds the weight of all other particles. When this condition holds, it is impossible to arrange the particles so their weighted centroid is at the origin, so the backwards time limit will tend towards an antipodal configuration, where all parti- cles not in the majority cluster will coalesce around the antipode of the cluster. This is the configuration which minimizes the magnitude of the weighted centroid. We do not include the proof that the backwards-time limit is antipodal in this case, but it is a straightforward generalization of the result which we do prove below. 28 2.10.2 Existence of hyperbolic gradient As mentioned above, if Z has the form in (2.13), then the ẇ equation in (2.5) reduces to ẇ = − 1 2 (1 − |w|2)Z(Mw(p)), (2.14) independent of the parameter ζ. We will show that this is a gradient flow on the unit ball Bd with respect to the hyperbolic metric. In the presence of a Riemannian metric we can associate a 1-form to any vector field, and the vector field is gradient if and only if the associated 1-form is exact; since the unit ball is simply connected, this holds if and only if the associated 1-form is closed. For the Euclidean metric on Bd (or any open subset of Rd) and standard coordinates w1, . . . ,wd, the 1-form associated to the vector field with components f1, . . . , fd is ω = f1 dw1 + · · · + fd dwd. If we scale the Euclidean metric by a positive smooth function φ, then the associated 1-form with respect to the metric ds = φ|dw| is then ω = φ2( f1 dw1 + · · · + fd dwd). Therefore the gradient of a function Φwith respect to this scaled metric is given by ∇Φ = φ−2∇eucΦ, where ∇euc denotes the ordinary Euclidean gradient operator. We have φ(w) = 2(1 − |w|2)−1 for the hyperbolic metric, so the hyperbolic gradient operator on Bd is given by ∇hypΦ(w) = 1 4 (1 − |w|2)2∇eucΦ(w). Now let’s consider the vector field V defined by (2.14). By linearity, it suffices to treat the case Z = xi, and we can take i = 1 without loss of generality. Then the associated 1-form is 29 ω = 4 (1 − |w|2)2 ( − 1 2 (1 − |w|2) ) · d∑ j=1 ( (1 − |w|2)(p1, j − w j) |p1 − w|2 − w j ) dw j = −2 d∑ j=1 ( p1, j − w j |p1 − w|2 − w j 1 − |w|2 ) dw j where p1, j denotes the jth component of the point p1 ∈ S d−1. Let E j denote the coeffi- cient of dw j in parentheses above; then dω = −2 d∑ j,k=1 ∂E j ∂wk dwk ∧ dw j. Applying the chain and quotient rules gives ∂E j ∂wk = 2(p1, j − w j)(p1, k − wk) |p1 − w|4 + 2w jwk (1 − |w|2)2 for j , k, which is symmetric in j and k; hence the sum above for dω simplifies to dω = 0. Thus ω is closed and we see that the flow (2.14) is gradient for any order parameter function of the form (2.13). Next, we show that the hyperbolic potential for V , up to an additive constant, is given by Φ(w) = N∑ i=1 ai log 1 − |w|2 |w − pi| 2 = 1 d − 1 N∑ i=1 ai log Phyp(w, pi). (2.15) Here we follow the convention that the potential decreases along trajectories, so we are asserting that ∇hypΦ = −V . To derive this, use the identity ∇euc|w − w0| 2 = 2(w − w0), for 30 any constant vector w0 ∈ Rd. Then ∇eucΦ(w) = N∑ i=1 ai ( − 2w 1 − |w|2 − 2(w − pi) |w − pi| 2 ) = 2 1 − |w|2 N∑ i=1 ai ( (1 − |w|2)(pi − w) |w − pi| 2 − w ) = 2 1 − |w|2 N∑ i=1 aiMw(pi) = 2 1 − |w|2 Z(Mw(p)). Hence we see that ∇hypΦ(w) = 1 2 (1 − |w|2)Z(Mw(p)) = −V(w), as desired. 2.10.3 Analysis of dynamics We can use the existence of the potential Φ(w) for the flow on Bd to prove a global synchrony result for the system (2.1) when the coefficients ai in the order parameter Z are all positive. Specifically, we assume that 0 < ai < 1/2 for all i, and ∑N i=1 ai = 1. We also assume N ≥ 3 and all the rotation terms Ai in (2.1) are equal. Under these condi- tions, almost all trajectories for (1) converge in forward time to the (d−1)-dimensional diagonal manifold ∆ ⊂ X as t → ∞, meaning that the system self-synchronizes. In contrast, in backwards time the system tends to an incoherent state having zero order parameter: as t → −∞ almost all trajectories for (1) converge to the codimension-d subspace Σ ⊂ X consisting of states with Z(p) = 0. The proof is modeled after Theorem 1 in Chen et al. [21] and will be based on two preliminary lemmas. In each of these lemmas we assume the conditions on the ai above, and that the base point p = (pi) for the flow (2.14) has all distinct coordinates. We begin with a general observation about gradient flows in the ball Bd: if w0 ∈ Bd is any initial condition and w∗ ∈ Bd is in the forward limit set Ω+(w0), then w∗ is a fixed 31 point for the flow. To see this, let Φ be a potential for the flow, and suppose w(tn) → w∗ ∈ Bd for some sequence tn → ∞. Since the potential decreases along trajectories, lim t→∞ Φ(w(t)) = lim n→∞ Φ(w(tn)) = Φ(w∗). Let Ft denote the time-t flow map. If w∗ is not a fixed point, then for any s > 0, lim t→∞ Φ(w(t)) = lim n→∞ Φ(w(tn + s)) = lim n→∞ Φ(Fs(w(tn))) = Φ(Fs(w∗)) < Φ(w∗), which is a contradiction, so w∗ must be a fixed point. (Compact limit sets are con- nected, so Ω+(w0) cannot consist of two or more but finitely many fixed points; how- ever it is possible that forward or backward limits sets for gradient flows consist of a continuum of fixed points. We will see that this is not the case for our system on Bd.) Lemma 2.10.1. Any fixed point for the flow (2.14) in Bd is repelling. Proof. Suppose w∗ ∈ Bd is a fixed point for (2.14). As discussed above, an advantage of using the w-parameter is the equivariance with respect to change of base point p. Consequently we can assume w∗ = 0 without loss of generality, so Z(p) = N∑ i=1 ai pi = 0. To first order in w, Mw(pi) = pi − w 1 − 2⟨w, pi⟩ − w = (pi − w) ( 1 + 2⟨w, pi⟩ ) − w = pi − 2w + 2⟨w, pi⟩pi. 32 The linearization of (2.14) at the fixed point w∗ = 0 is ẇ = − 1 2 N∑ i=1 ai ( pi − 2w + 2⟨w, pi⟩pi ) = w − N∑ i=1 ai⟨w, pi⟩pi. We claim that the linear map Tw = N∑ i=1 ai⟨w, pi⟩pi has ||T || < 1; to see this, suppose |w| = 1. Then |⟨w, pi⟩pi| ≤ 1 and Tw is a convex combination of the vectors ⟨w, pi⟩pi. We can only obtain |Tw| = 1 if all terms ⟨w, pi⟩pi = u with |u| = 1, which implies all pi = ±u, and this cannot happen if at least three of the pi are distinct. Hence ||T || < 1 and so the eigenvalues µi of T satisfy |µi| < 1. The eigenvalues for the ẇ linearization are λi = 1 − µi, so wee see that Re λi > 0 for all i, establishing that the fixed point w∗ is repelling. □ Lemma 2.10.2. lim |w|→1 Φ(w) = −∞. Proof. It suffices to show that lim n→∞ Φ(wn) = −∞ for any sequence wn ∈ Bd with wn → x ∈ S d−1. The result is clear if x , pi: as n → ∞ the terms |wn − pi| in the potential (3.1) are bounded away from 0, and 1− |wn| 2 → 0. So let’s say that wn → p1. We rewrite Φ(wn) as 33 Φ(wn) = log(1 − |wn| 2) − 2a1 log |wn − p1| − 2 N∑ i=2 ai log |wn − pi| = log(1 − |wn|) − 2a1 log |wn − p1| + log(1 + |wn|) − 2 N∑ i=2 ai log |wn − pi|. The latter two terms above have finite limit as n → ∞, so we focus on the first two terms. We have 1 − |wn| ≤ |wn − p1|, so log(1 − |wn|) − 2a1 log |wn − p1| ≤ (1 − 2a1) log |wn − p1| → −∞ as n → ∞, which proves our result. Notice that we need the assumption ai < 1/2 for this argument. □ Theorem 2.10.3. Under the conditions above, almost all trajectories for (2.1) converge to ∆ as t → ∞ and to Σ as t → −∞. Proof. Let p = (p1, . . . , pN) ∈ X be any point with all distinct coordinates. The points on Gp are parametrized by w ∈ Bd and ζ ∈ S O(d), and the dynamics for these pa- rameters are given by (2.5). We begin with the dynamics as t → −∞. Let w(t) be a trajectory for (2.14) with initial condition w0 ∈ Bd, and consider the backward time limit set Ω−(w0); this is a nonempty, compact, connected subset of Bd. The potential Φ is decreasing along all trajectories w(t), hence bounded below as t → −∞, so Lemma 2 implies that the limit set Ω−(w0) must be contained in the interior Bd. We know that any w∗ ∈ Ω−(w0) is a fixed point for the flow. By Lemma 1, w∗ is repelling and so any trajectory w(t) which comes sufficiently close to w∗ must have w(t) → w∗ as t → −∞; therefore Ω−(w0) = {w∗}. This proves the existence of fixed points for (2.14), and that every trajectory w(t) converges to a fixed point as t → −∞. If the flow had multiple 34 fixed points, we would obtain a partition of Bd into the disjoint open basins of repul- sion of the fixed points, violating connectedness of the ball. Therefore (2.14) has a unique fixed point w∗, and w(t) → w∗ as t → −∞ for all trajectories. The fixed point w∗ has Z(Mw∗(p)) = 0, so all trajectories in Gp converge to Σ as t → −∞. In forward time, the limit set Ω+(w0) for any w0 , w∗ must be completely contained in the boundary S d−1, since the unique fixed point w∗ ∈ Bd is repelling. Suppose we remove the factor (1/2)(1 − |w|2) in the flow (2.14); the scaled vector field on Bd given by ẇ = − N∑ i=1 aiMw(pi) = w − N∑ i=1 ai ( (1 − |w|2)(pi − w) |pi − w|2 ) (2.16) has the same trajectories as the original flow, just with different time parametrizations. Observe that this scaled vector field extends smoothly to Rd − {pi}, and coincides with the radial vector field x at any x ∈ S d−1 with x , pi. Therefore there is a unique trajectory passing through each point x ∈ S d−1, flowing from the interior to the exterior of the sphere, as long as x , pi. Consequently the original flow (2.14) has a unique trajectory w(t) in Bd with w(t) → x as t → ∞, as long as x , pi. This also shows that there is a neighborhood U of S d−1 − {pi} such that if w(t0) ∈ U for some t0, then w(t) → x , pi for some x ∈ S d−1. So if Ω+(w0) contains some x , pi, then the trajectory w(t) of w0 must enter the neighborhood U, and therefore w(t)→ x ∈ S d−1 as t → ∞. Since limit sets are connected, the only other possibility is Ω+(w0) = {pi} for some i; equivalently, w(t) → pi. We will show that there is a unique trajectory with this behavior for each pi. Assuming this, we see that with N + 1 exceptions, any trajectory w(t) converges to a point x ∈ S d−1 with x , pi (the exceptions are the N trajectories converging to the base point coordinates pi, and the fixed point trajectory w∗). The corresponding trajectory in Gp has coordinates ζ(t)Mw(t)(pi) = ζ(t) ( (1 − |w(t)|2)(pi − w(t)) |pi − w(t)|2 − w(t) ) . 35 We have |w(t)| → 1 and |pi − w(t)| is bounded away from 0 as t → ∞, so Mw(t)(pi) → −x for each i and therefore the trajectory ζ(t)Mw(t)(p) in Gp converges to ∆ as t → ∞. This analysis breaks down at x = pi because the scaled vector field above does not have a unique limit as w → pi; rather, its limit depends on the direction of the approach. To see this, write w = p1 − ru, where 0 < r < 1 and |u| = 1 (with this convention, u = p1 corresponds to w approaching p1 radially). Then |p1 − w| = r and |w|2 = 1 − 2r⟨p1, u⟩ + r2 so (1 − |w|2)(p1 − w) |p1 − w|2 = (2r⟨p1, u⟩ − r2)ru r2 = (2⟨p1, u⟩ − r) u. As r → 0, the magnitude of this term is 2⟨p1, u⟩, which depends on the angle of ap- proach given by u (note that ⟨p1, u⟩ > 0 because u points outwards at p1). To complete the proof, we will examine the scaled system (2.16) using the polar representation (r, u), and show that the polar system has the unique fixed point r∗ = 0, u∗ = p1, which has a unique attracting trajectory because it is a saddle with a (d − 1)- dimensional unstable manifold. We see that the scaled system has ẇ = p1 − ru − a1 (2⟨p1, u⟩ − r) u + O(r) = p1 − 2a1⟨p1, u⟩u + O(r), where the O(r) term is a smooth function of r and u for |r| < ε = min |pi − p1|, i ≥ 2. This condition insures that |pi − w| ≥ |pi − p1| − |r| > 0, so the i ≥ 2 terms in the scaled ẇ equation are all smooth functions of r and u. And we can allow r < 0 here, even though it is not relevant to the ẇ system. Now r2 = |w − p1| 2, so rṙ = ⟨w − p1, ẇ⟩ = −r⟨u, ẇ⟩, 36 which gives ṙ = −(1 − 2a1)⟨p1, u⟩ + O(r). Differentiating ru = p1 − w gives ru̇ = −ṙu − ẇ = (1 − 2a1)⟨p1, u⟩u − ( p1 − 2a1⟨p1, u⟩u ) + O(r) = ⟨p1, u⟩u − p1 + O(r). Hence the scaled system in polar form can be written rṙ = −(1 − 2a1)r ⟨p1, u⟩ + O(r2), ru̇ = ⟨p1, u⟩ u − p1 + O(r). We emphasize that the O(r) and O(r2) terms are smooth functions of r, u as long as |r| < ε. We consider the “semi-scaled” polar system ṙ = −(1 − 2a1)r ⟨p1, u⟩ + O(r2), (2.17a) u̇ = ⟨p1, u⟩ u − p1 + O(r), (2.17b) which has the same trajectories as the original system, just with different time parametrizations. The advantage of this modified system is that the equations are smooth on (−ε, ε) × S d−1. Observe that the system (2.17) has {0} × S d−1 invariant, and has fixed point (r∗, u∗) = (0, p1). The fixed point p1 is repelling on the invariant manifold {0} × S d−1; to see this, observe that ⟨p1, u⟩˙= ⟨p1, u⟩2 − 1 when r = 0. In fact, if we assign the coordinate θ on any great circle joining p1 and −p1 on S d−1 so that u = eiθ and p1 = 1, then the system reduces to θ̇ = sin θ. We also 37 see that the ṙ equation linearized at (0, p1) is ṙ = −(1 − 2a1)r, so the linearization of (2.17) has the single negative eigenvalue −(1 − 2a1) and d − 1 positive eigenvalues +1. Therefore (0, p1) is a saddle with a one-dimensional stable manifold, and hence has a unique trajectory (r(t), u(t))→ (0, p1) with r(t) > 0. Now suppose we have a trajectory w(t) → p1 in our original system (2.14). The corresponding trajectory for (2.17) will have r(t) → 0; we cannot achieve r(t) = 0 in finite time because the manifold {r = 0} is invariant for (2.17). We must prove that u(t)→ p1, so that this trajectory is in fact the saddle stable manifold. Observe that ⟨p1, u⟩˙= ⟨p1, u⟩2 − 1 + O(r). Also note that ⟨p1, u(t)⟩ > 0 since |w(t)| < 1. Let 0 < c < 1; then for some T ≥ 0, t ≥ T implies O(r(t)) ≤ (1 − c2)/2. Now suppose 0 < ⟨p1, u(t0)⟩ < c for some t0 ≥ T ; then 0 < ⟨p1, u(t)⟩c for all t ≥ t0. This is because the function t 7→ ⟨p1, u(t)⟩ is decreasing if 0 < ⟨p1, u(t)⟩ < c: ⟨p1, u(t)⟩˙≤ c2 − 1 + 1 2 (1 − c2) = − 1 2 (1 − c2) as long as 0 < ⟨p1, u(t)⟩ < c. But this also implies that eventually ⟨p1, u(t)⟩ < 0, which is a contradiction. Hence we must have ⟨p1, u(t)⟩ ≥ c for all t ≥ T , which proves that u(t)→ p1. □ 38 Part II Electrostatic knot theory 39 CHAPTER 3 A LOWER BOUND ON THE CRITICAL POINTS OF THE ELECTRIC POTENTIAL OF A KNOT This chapter originally appeared as ”A lower bound on critical points of the electric potential of a knot” in Journal of Knot Theory and its Ramifications, Vol. 30. [66] 3.1 Preliminaries and problem formulation Our novel problem of interest is to analyze the zeros of the electric field around a charged, knotted wire fixed in place. Let K ⊆ R3 ⊆ S 3 be a smooth knot parametrized by the curve r(t), t ∈ [0, 2π] with r(0) = r(2π). We will take the convention that S 3 is the union of R3 and a single compactifying point at infinity. Suppose K is endowed with a uniform charge distribution. With a choice of units, the electric potential be- tween a point k ∈ K and a point charge at x at a distance R from k is proportional to R−1. It therefore makes sense to define the electric potential Φ : S 3 − K → R, on the complement of K by the line integral Φ(x) = ∫ k∈K dk |x − k| = ∫ 2π 0 |r′(t)| |x − r(t)| dt, x ∈ R3 − K. (3.1) We set Φ(∞) = 0 to ensure smoothness. By differentiating under the integral sign with respect to x, we can see the electric potential is smooth and harmonic. The elec- tric field is defined by E = −∇Φ. We want to describe the critical points of the potential (equivalently, the zeros of the electric field) and their behaviors. These represent equi- librium points where a charged particle at rest will continue to experience no electric force from the charge distribution. Some conventions use the negative of the poten- tial, so that the electric field points towards the knot, but it is more convenient for our purposes to work with a nonnegative potential function. 40 Define the knot invariant cp(K) to be the smallest number of critical points of the electric potential among all parametrizations in the knot isotopy class [K]. All parametrizations have a critical point at infinity, which we include in the count. We will now assume r is a parametrization which yields a critical set of minimal size. We obtain a lower bound for the number of critical points of the electric potential based on a well known topological invariant called the tunnel number t(K). The tunnel number was originally introduced by Clark [25], and remains an active topic of knot theory research [5]. We now come to the main result of this article. Proving this theorem uses Morse theory and stable manifold theory. Theorem 3.1.1. For all knots K, cp(K) ≥ 2t(K) + 2. 3.2 Preliminary definitions and lemmas The theorems of Morse theory require us to work on a compact manifold, so in the sequel we will define the knot complement of K in S 3 to be the complement of an open tubular neighborhood of K of sufficiently small radius. The theorems from Morse theory we invoke will still hold on this compact manifold with boundary because the potential function is proper, with gradient transversely intersecting the boundary. We will still denote our domain by S 3 − K. Many of the following definitions and results are standard and can be found in Nicolaescu [88] and Burde [15]. A critical point of a smooth real valued function f on a manifold M is a point p such that the differential d f (p) is zero. The critical set of f is the set of critical points, and is denoted Crit( f ). We say f is Morse if its critical points are nondegenerate, which means the Hessian matrix H( f ) of second partial derivatives is nonsingular. The index of p ∈ Crit( f ) is the number of negative eigenvalues of H( f ), 41 which is invariant under the choice of local coordinates. We denote the set of critical points of index i by Criti( f ). If f is fixed, we denote the index of p by λ(p) and we denote the number of critical points of index i by mi. In the space of all smooth real valued maps, under a suitable function space topology, the set of Morse functions is dense. Therefore, we may assume the electric potential Φ is Morse by adding a perturbation if necessary. We write WS (p) and WU(p) to denote the stable and unstable manifolds of p ∈ Crit( f ) respectively. Recall that the stable manifold (resp. unstable manifold) of p is the set of all points which flow to p along the gradient vector field ∇ f as time tends to infinity (resp. negative infinity). If f is Morse, the dimension of WS (p) is λ(p) and the dimension of WU(p) is dim M − λ(p). The Morse Reconstruction Theorem states the domain of a Morse function on a compact manifold can be expressed as a cell complex by attaching closed discs with dimensions given by the indices of the critical points. The attaching maps are obtained by a process known as surgery, but we will not need to discuss the attaching maps in any further detail. The Morse inequalities state for a fixed Morse function f on a manifold M, mi ≥ bi, where bi = dim Hi(M) is the ith Betti number of our domain manifold. We will need the stronger result which states dim M∑ i=0 (−1)imi = dim M∑ i=0 (−1)ibi = χ(M), (3.2) with χ(M) denoting the Euler characteristic of M. See Nicolaescu, Corollary 2.3.3 [88]. Before turning to the proof of Theorem 3.1.1, we need to prove a few preliminary lemmas. 42 Lemma 3.2.1. For all knots K, Hi(S 3 − K) = Z, for i = 0, 1 and Hi(S 3 − K) = 0 for i ≥ 2. Proof. See Rolfsen, Proposition 3.A.3 [109]. □ From the homology of the knot complement, along with (3.2), we can deduce the following lemma. Lemma 3.2.2. For all knots K, and with Φ defined above, m1 − m2 = 1. Proof. Equation (3.2) states the Euler characteristic equals the alternating sum of the mi’s. In other words, χ(S 3 − K) = 3∑ i=0 (−1)imi. From Lemma 3.2.2, we can see that χ(S 3 − K) = 1 − 1 = 0. Since Φ is harmonic, every critical point has index 1 or 2, save for the point at infinity, which has index 0. We can conclude m0 − m1 + m2 − m3 = 0, or equivalently, m1 − m2 = 1 as desired. □ Remark 3.2.3. As m2 ≥ 0, m1 = 1 + m2 ≥ 1. That is, there is always a critical point of index 1. The next set of definitions and results are standard in 3-manifold topology, and fur- ther exposition can be found in [113] and [111]. A handlebody is a topological space homotopic to the three dimensional ball with solid handles attached (by “attaching handles” we mean there are copies of D2 × [0, 1] where the boundary discs D2 × 0 and D2 × 1 are embedded on the boundary of the three-ball). Given a knot K ⊆ S 3, the tunnel number t(K) is the least number of arcs we must add to K such that the com- plement in S 3 is a handlebody. A collection of arcs with this property is known as a tunneling. In the proof of Theorem 3.1.1, we shall use the above definition of the tunnel num- ber, but there is an equivalent definition that is more visually intuitive. A Heegaard 43 splitting of a three-manifold M is an embedding of a closed, compact, and orientable surface H such that the interior and exterior of H in M are both handlebodies. We say the genus of H is the genus of the splitting. Theorem 3.2.4. Let H be an unknotted embedding of a genus g surface in S 3. That is, let H be the boundary of a tubular neighborhood around a wedge sum of g unknotted circles. Then H defines a Heegaard splitting. Theorem 3.2.5. Genus g Heegaard splittings of S 3 are unique up to isotopy. The previous result is known as Waldhausen’s Theorem. Clearly, a tunneling of a knot defines a Heegaard splitting. Therefore, we can view the tunnel number of K as the least number of arcs we must add to K such that it is isotopic to a wedge sum of unknotted circles. We can immediately deduce that tunnelings and the tunnel number always exist. Lemma 3.2.6. Every smooth knot K has a tunnel number. Proof. Take a diagram of K with finitely many crossings. Over each crossing, introduce an arc connecting the top and bottom strands. Collapse each arc so that the top and bottom strands intersect, and project onto the diagram’s plane so that we are left with a wedge sum of say, g circles. By Theorem 3.2.4, it follows that the complement in S 3 is also a handlebody with g handles. □ Remark 3.2.7. We just proved the tunnel number is bounded above by the crossing number, the least number of crossings needed in a knot diagram of K. As some elementary examples, the tunnel number of the unknot is zero, and the tunnel number of the trefoil is one. Indeed, the tunnel number need not be the crossing 44 number. For example, torus knots have tunnel number 1, yet can have arbitrarily high crossing number. See Clark [25]. 3.3 Proof of the tunnel number bound We now come to the proof of our main result. To prove the result, we construct a tunneling with m2 arcs. This proves cp(K) ≥ m2 ≥ t(K). Then, by applying Lemma 3.2.2, we get that cp(K) = m0 + m1 + m2 + m3 ≥ 1 + (t(K) + 1) + t(K) + 0 = 2t(K) + 2 as desired. 3.3.1 Construction of the tunneling We will apply the Morse Rearrangement Lemma to allow us to make some ad- ditional convenient assumptions about Φ without losing generality. A proof can be found in Nicolaescu, Chapter 2.4 [88]. The theorem states we can find a smooth Φ̂ : S 3 − K → R satisfying the following properties: • Φ and Φ̂ share the same critical points, and each critical point has the same index. • Suppose p and q are distinct critical points. Then Φ̂(p) , Φ̂(q). If Φ̂(p) < Φ̂(q), then λ(p) ≤ λ(q). • Inside of a neighborhood of each critical point, the gradient flows for Φ and Φ̂ are identical. • If γ is an integral curve of ∇Φ̂, then Φ(γ(t)) is strictly increasing in t. A vector field with this property is called gradient-like with respect to Φ. • For p, q ∈ Crit(Φ̂), WS (p) and WU(q) intersect transversely. Morse functions with this property are called Morse-Smale. 45 This theorem allows us to perturb the values of the critical points so we can reorder them ascending by index without affecting the topological data encoded by the origi- nal potential. At this point, we are not necessarily working with the physical potential whose formula is given in (3.1), but for simplicity we will still refer to the perturbation as Φ. Our rearrangement restricts the limiting behavior of trajectories. Let γ : (−∞,∞)→ S 3 − K be a trajectory for Φ. When t → −∞, Φ(γ(t)) strictly decreases, but it is bounded below by 0, by assumption. Should lim t→−∞ Φ(γ(t)) = 0, then lim t→−∞ γ(t) = ∞, the point of infinity on S 3, because it is the only point in the knot complement with zero potential. Otherwise, lim t→−∞ γ(t) is a critical point. Similarly, we can deduce that for t → ∞, either γ(t) tends to a critical point or K, since Φ(γ(t)) is strictly increasing. Should both ends of γ be critical points, then we know the index of the critical point at t = −∞ is less than or equal to that of the critical point at t = ∞. Consider the critical points of index 2. For p2 1, . . . , p2 m2 ∈ Crit2(Φ), let Γi = WU(p2 i ). See Fig. 3.1 for a schematic of the trefoil case. Notice each Γi is a union of two tra- jectories leaving p2 i , since the unstable manifold has dimension 1. Since Φ strictly increases along trajectories, we have that the trajectories will either tend to K or to another critical point of index 2 as t → ∞. However, should either end of Γi tend to a critical point q ∈ Crit2(Φ), then the corresponding trajectory will be a submanifold of WS (q). However, WS (q) and Γi are 2 and 1-submanifolds respectively, and for them to intersect transversely as per our assumption, their intersection cannot be more than 0 dimensions. Therefore, we conclude both ends of Γi eventually reach the tubular neighborhood of the knot. Our tunneling is only concerned with the arc outside of the tubular neighborhood, so we can assume Γi is defined only on a compact interval. 46 Figure 3.1: The arcs we add to K are the unstable manifolds associated to critical points of index 2. Note that this diagram does not necessarily depict the specific situation accurately for the trefoil. Now consider the critical points of index 1. For p1 1, . . . , p1 m1 ∈ Crit1(Φ), let Θ j = WS (p1 j). Analogous to before, each Θ j is a union of two trajectories tending to p1 j . Sim- ilar reasoning will tell us that both ends tend to the point at infinity. Indeed, each Θ j is a union of two trajectories tending towards a critical point p1 j of index 1. As t → −∞, Φ will strictly decrease along these trajectories, so we know that the negative infinite limits of these trajectories must either be another critical point of index 1, or the point at infinity. By the transversality assumption of the stable and unstable manifolds, we cannot have that the endpoints ofΘ j are critical points of index 1. Therefore, both ends are at the point at infinity. We may view the union of all the Θ j arcs as a wedge sum of circles at the point of infinity, which we denote m1∨ j=1 Θ j. Notice m1∨ j=1 Θ j is homotopy equivalent to a handlebody. Using a standard maneuver from differential topology, we will flow along the (negative) gradient to perform a deformation retraction from S 3 − (K ∪ Γ1 ∪ · · · ∪ Γm2) to m1∨ j=1 Θ j. We will opt to work with smooth tubular neighborhoods of both of these spaces, but there is a crucial technical lemma we must prove before proceeding. 47 Figure 3.2: A sketch depicting the tubular neighborhood around a regular point of Γi in specially chosen local coordinates which makes the gradient parallel. This is a projection to the xz-plane. 3.3.2 Constructing a smooth boundary around the tunneling In this subsection, we prove the following lemma. Lemma 3.3.1. There are tubular neighborhoods of both K ∪ Γ1 ∪ · · · ∪ Γm2 and m1∨ j=1 Θ j with smooth boundary such that the gradient vector field points inwards and outwards respectively. Proof. It is a standard fact from electrostatics that neighborhoods of K and the point at infinity exist such that the gradient points inwards and outwards respectively. This proof will make use of the tubular flow lemma, which states for every regular point of S 3 − K, there are local coordinates such that the gradient flow takes the constant form ∇(x0,y0,z0)Φ = (1, 0, 0) = ∂ ∂x . Take local coordinates centered at a regular point of Γi so that the portion of the arc inside our coordinate chart is mapped to the path of unit speed along the x-axis, γ(t) = (t, 0, 0). 48 Figure 3.3: A sketch depicting the tubular neighborhood around a critical point on Γi, in specially chosen local coordinates such that Φ is a quadratic with signature (1, 2). This is a projection to the xz-plane. Around a segment of the x-axis, we can choose our tubular neighborhood of Γi to be the interior of the slanted tube defined by y2+ z2 = ( 1 2 x+1)2, where we possibly restrict our local coordinates so −1 < x < 1. See Fig. 3.2. Notice the gradient points inwards from the boundary. If we expand around a regular point on a Θ j arc, we can reflect the tube in the x direction so the gradient points outwards. Note that the properties of a vector field pointing inwards and outwards from a boundary are invariant under a change of coordinate charts in an orientation-preserving atlas. Now suppose we want to take local coordinates around a critical point p2 i of Γi, which has index 2. By Lemma 2.2 of [82], we can find local coordinates cen- tered at the critical point such that Γi corresponds to the x-axis, and Φ takes the form Φ(x0, y0, z0) = 1 2 (x2 0 − y2 0 − z2 0) + c, for some constant c. Thus, ∇Φ takes the form ∇Φ(x0, y0, z0) = (x0,−y0,−z0). Consider the tube around the x-axis defined by y2+ z2 = 1. 49 See Fig. 3.2. Using an abuse of notation, we will refer to this tube as ∂Γi. We will show the gradient points inside the tube, as Figure 3.3 shows. At a point (x0, y0, z0) ∈ ∂Γi, the tangent plane is spanned by (1, 0, 0) and (0,−z0, y0). As the point varies on the tube, this oriented basis of the tangent plane varies smoothly, thus defining an orientation of the tube. The triple of tangent vectors {(1, 0, 0), (0,−z0, y0),∇Φ(x0, y0, z0)} therefore defines a smooth choice of orientation for the three dimensional ambient tangent spaces sur- rounding the tube. Therefore, if the gradient points inside the tube at one point of the tube, it does so throughout the whole tube. For example, at the point p = (0, 1, 0) ∈ ∂Γi, ∇Φ(p) = (0,−1, 0) clearly points inside the tube. The existence of a surface surrounding the arc with which the gradient points inwards is a significant topological obstruction to proving our main theorem. The construction would not be possible if the critical point had index 1, or if we were asked to place a tube around another axis. The case for a critical point of Θi is analogous, and we get the result that the gradient points outwards. By compactness of the arcs Γi and Θ j we only need to construct finitely many tubes around arc segments to encapsulate the entire arc. When the tubes cover the same part of the arc, we may have to shrink the radius of one of the tubes so the boundaries will intersect, but the resulting tube will still have the gradient pointing in the correct direction. By taking the final boundary to be the points of minimal radial distance to the arc, we obtain a connected boundary to the tubular neighborhood that is only piecewise smooth. Likewise, the tubular neighborhoods around the Γi arcs intersect the tube around K, and the gradient points inwards on the boundary of the union. This construction is sufficient to prove the lemma. To get a smooth boundary out of the piecewise smooth boundary, one could either make a density argument in a space of manifolds [88], justify why the theorems work in the piecewise smooth case [82], or use mollifiers to smooth out the kinks [116]. For the sake of space, we omit the 50 technical details. For reasons that will be clear when we perform the deformation retraction, we will want to include a ball around the point at infinity in the tubular neighborhood. In the standard R3 coordinates, this is the complement of a large open ball. We can assume this neighborhood around ∞ contains Φ−1([0, ε]) for some ε > 0. On the boundary sphere, we can again use mollifiers to connect the tube smoothly whilst preserving their orientations against the gradient flow. □ 3.3.3 The deformation retraction to a handlebody The final step is to use the flow of E to perform the deformation retraction. We will use the closed tubular neighborhoods described in Lemma 3.3.1 as an alternative to just the knot with arcs attached and the wedge of circles. Let A be the aforementioned tubular neighborhood of the knot with the Γi arcs attached, and let B denote the afore- mentioned tubular neighborhood of the Θ j arcs connected to a ball around the point at infinity. Let E(x, t) be the flow of the negative gradient. The point E(x, t) refers to the loca- tion of the path at time t starting from the unique integral curve starting at x. First, we prove every point in S 3 − A will eventually flow to B. Suppose x ∈ S 3 − A. There are three possibilities for the limit of the negative gradient flow of x: it could flow to K, it could flow to a critical point of index 1 or 2, or it could flow to the point at infinity. Since the negative gradient points outwards from the boundary of A, x cannot flow to K or a critical point of index 2. Therefore, x either flows to a critical point of index 1, or to the point at infinity, which means that x eventually flows to B. Furthermore, since the negative gradient points into B, once x enters B, it will never leave. This fact still holds even in the vacuous case where m2 = 0 and therefore A = K. 51 By smoothness of the boundary of B, the function which assigns each x ∈ S 3 − A the minimum time t such that E(x, t) ∈ B is smooth. Call this function C(x). Notice C assigns 0 to each point already in B. By compactness of the domain, the func- tion reaches a maximum value Cmax. Define a homotopy H on (S 3 − A) × [0,∞) by H(x, t) = E(x,min(t,C(x))), which we can see is continuous. Also notice that for x ∈ B, H(x, t) = x for all t. Running this homotopy on the time interval [0,Cmax] completes the deformation retraction. This completes the proof of our main theorem. 52 CHAPTER 4 FURTHER TOPOLOGICAL APPROACHES TO KNOTTED ELECTRIC CHARGE DISTRIBUTIONS This chapter orignally appeared as “Topological approaches to knotted electric charge distributions” in Partial Differential Equations and Applications Vol. 4, Iss. 2 [68]. 4.1 Introduction Imagine a closed knotted wire in space fixed in place, with uniform electric charge distributed on it. Our novel problem of interest is to understand the properties of the knot’s electric field, a topic with applications to the study of knotted DNA molecules in viruses [95], the design of electrical circuits [35], and broader questions in physical knot theory [55]. In our work, the knot is fixed in space. By contrast, others have studied how knotted curves deform, as discussed in the extensive literature on the energy methods and flows of knots [12, 34, 89, 137]. The electric field around a charged knot is the negative gradient of a potential function, so the zeroes of the electric field correspond to critical points of the potential. Our primary objects of study are the structurally stable critical points with respect to small perturbations of the charge distribution. Physically, these are the points where all of the electric forces cancel. Thus some basic questions arise: How many critical points are there? What are their local behaviors? And how does the critical set relate to topological properties of the knot? In this paper, we present some results which address these broad questions through the use of techniques from geometric topology, and offer some rigorous 53 proofs for some of the results communicated in [71]. Whilst the methods we apply are standard in low-dimensional topology, research in electrostatics and physical knot theory rarely implement these methods. We use a topological counting argument to show that the size of a knot’s structurally stable critical set sharpens from a previously proven lower bound based on the knot’s tunnel number [65]. Next, we calculate the effect of the Morse surgery on their genera. A critical point of Morse index 1 corre- sponds to an increase in genus as we raise the level of the potential, whilst a critical point of index 2 corresponds to a decrease in genus. The tuple of equipotential sur- face genera is a Scharlemann-Thompson handlebody width decomposition of the knot complement [112], but our tuple differs from the classical thin decomposition in that we exclusively consider those arising from potentials, and we do not lexicographically reorder the tuples. Let K ⊆ R3 ⊆ S 3 be a smooth, tame knot parametrized by the curve r(t), t ∈ [0, 2π] with r(0) = r(2π). We will take the convention that S 3 is the union of R3 and a single compactifying point at infinity. Suppose K is endowed with a uniform charge distri- bution. With a choice of units, the electric potential between a point k ∈ K and a point charge at x at a distance R from k is proportional to R−1 by Coulomb’s Law. It therefore makes sense to define the electric potential Φ : S 3 − K → R, on the complement of K by the line integral Φ(x) = ∫ k∈K dk |x − k| = ∫ 2π 0 |r′(t)| |x − r(t)| dt, x ∈ R3 − K. (4.1) We set Φ(∞) = 0 to ensure smoothness. By differentiating under the integral sign with respect to x, we can see the electric potential is smooth and harmonic. The elec- tric field is defined by E = −∇Φ. The critical points of Φ represent equilibrium points where a charged test particle at rest will continue to experience no electric force from the charge distribution. Some conventions define the potential to be negative, in order 54 for the electric field to point towards the knot, but it is more convenient for our pur- poses to work with a nonnegative potential function. Typically charged wires are pre- sented in physics literature as the integral over a solid, thin torus, where our idealized uniform charge distribution over a curve is approximated by a nonuniform charge dis- tribution over a solid, which depends greatly on the geometry of the parametrization. For background on the physical interpretation of the potential, see [39]. However, our work is concerned with critical points which occur far away from the knot, so this distinction does not alter our results. The electric potential is difficult to analyze in general, as integral formulas can be highly nonlinear and nonsymmetric, but the potential and electric field inte- grals can be numerically approximated via Gaussian quadrature, and the solutions to E(x) = 0 can be determined by the multivariable Newton method. Initial nu- merical approximations of the critical sets for various knots were implemented by Townsend and Lipton [64], showing that there are isotopic parametrizations of the unknots which have critical sets of differing size. In particular, the trivial unknot em- bedding r(t) = (cos(t), sin(t), 0) has a single critical point in the origin, whilst the (5, 1) unknot with parametrization r(t) = (cos(t + 2) cos(5t), cos(t + 2) sin(5t),− sin(t)) has three critical points, as depicted in Figure 1. Figure 4.1: A top-down projection of two knots isotopic to the unknot with differing critical sets. It is unsurprising that the critical set is not invariant under isotopy, given the de- pendence ofΦ on r, but a 2021 result of the author [65] proved there is indeed a restric- tion on the critical set based on a knot invariant called the tunnel number. Let cp(K) 55 denote the smallest size of structurally stable critical sets of K over all parametriza- tions, which includes the critical point at infinity in the count. Let t(K) denote the tunnel number, which is the smallest number of arcs one must attach to K such that the complement is a handlebody. By Waldhausen’s theorem, t(K) is equivalently the smallest number of arcs one must attach such that a tubular neighborhood of the union is isotopic to an unknotted embedding of a handlebody. For example, it is trivial to see t(unknot) = 0, whilst a geometric argument depicted in Fig. 2 of [84] shows t(trefoil) = 1. The aforementioned result of the author is stated as follows. Theorem 4.1.1. For all knots K, cp(K) ≥ 2t(K) + 2. Theorem 4.1.1 was proven using techniques in geometric topology, Morse theory, and stable manifold theory. The proof does not apply Morse reconstruction to Φ itself, but rather, it applies the Morse Rearrangement Lemma (see Lemma 2.4.12 of [88]) to Φ, which allows us to alter the global, long-term dynamical behavior of the gradient to conform to certain convenient properties, whilst leaving the local behavior around the critical points unchanged. Critical sets of harmonic functions can be pathological in general, but the conditions we impose avoid these edge cases, and so the critical sets under consideration will always be finite [114] and lie in the convex hull of the charge distribution [126], which we assume is bounded. The inclusion of the structurally sta- ble requirement in our definition of cp(K) is important, because as we will discuss in Section 2, generic knot isotopies (under the C∞ topology) can induce bifurcations which create or annihilate pairs of critical points, with a structurally unstable degen- erate critical point of high multiplicity occurring during the transition. Demanding that the potential is Morse avoids undercounting the critical points. In Section 3, we prove a partial sharpening of Theorem 4.1.1. The tunnel number is a poor measure of a knot’s complexity. For instance, all torus knots have tunnel num- 56 ber 1, even though they can have arbitrarily high crossing number. Likewise, prime knots with up to 10 crossings can only have tunnel numbers 1 or 2 [85]. However, we can show the bound sharpens when the knot is sufficiently close to a planar curve, defined by the notion of a δ-lifting which is given later on. We then show that given a planar knot diagram with c crossings, there is a sufficiently small δ > 0 such that the size of the critical set for its δ-liftings is at least 2c+2. This result is proven by applying the Poincare-Hopf Index Theorem to the electric field of the projected knot to prove the existence of a critical point in each connected component in the projection’s planar complement, of which there are c+ 2, including the critical point at infinity which cor- responds to the unbounded component. We then use the Morse inequalities to show that lifting each crossing from the projection necessitates the creation of another criti- cal point due to the change in the topology of the knot complement, which yields our final count of 2c + 2. In Section 4, we prove a result describing the topology of the equipotential surfaces in terms of the critical set. The critical values of Φ partition the positive real numbers into finitely many intervals, and a key idea from Morse theory states that the level sets of two values from the same interval are homeomorphic. This leads us to the central object of the section: the Morse code of a knot, which is a tuple of numbers listing the genera of the equipotential surfaces corresponding to each of the intervals. Another key idea from Morse theory states the passage from one equipotential surface to another is determined, up to homotopy equivalence, by the attachment of a disc of appropriate dimension. Even though the attachment maps can be quite pathological in general, we prove that the potential yields particularly simple surgeries, showing that a critical point of index 1 corresponds to an increase in the subsequent term of the Morse code, whilst a critical point of index 2 corresponds to a decrease. 57 4.2 Bifurcations through degenerate critical points A strategy for finding more precise statements than Theorem 3.1.1 is to bifurcate the electric potential’s critical set via knot isotopies into a set with a sharper lower bound. One such class of isotopies proposed by Lipton-Strogatz-Townsend is the “flattenings,” where one of the coordinate functions is scaled by the parameter s, as s ranges from 1 to 0. The final state is a self-intersecting planar curve, but we opt to con- tinue referring to this process as an ambient isotopy in an abuse of notation. In [71], an article written for a general audience of physicists, the authors implement numerical simulations of charged knots yielding critical sets whose sizes exceed the proven the- oretical lower bound. In this section, we discuss the general fashion with which these critical sets transition from one to another as the electric potential bifurcates. Essen- tially, as the bifurcation is through harmonic functions, this constrains the transitions to be the creation or destruction of pairs, modulo structurally unstable symmetries arising from the particular parametrization and isotopy. A knot isotopy induces a path in C∞(M) via the corresponding potentials, but the path could pass through non-Morse functions. The proofs of our theorems in the subsequent sections make liberal use of the Morse property, and break down when Φ is not Morse. However, using Cerf theory, we can extract a generic property of knot isotopies in which the path through non-Morse potentials occurs only finitely many times, and their degenerate critical points can be classified based on partial derivatives of the isotopy. We will see that these degenerate critical points correspond to saddle-node bifurcations. It should be emphasized that this result does not describe the dynamics of a moving charged knot, as we do not take into account the induced magnetic forces from the moving charges. Rather, we are describing a bifurcation among a smooth family of possible fixed, rigid knot configurations. 58 We now state a part of the Cerf Structure Theorem in terms of the homotopies of Morse functions. There is a more general structure theorem given by a stratification [88, 33] of the class of isotopies via the coordinate expansion of general degenerate critical points. A full exposition can be found in Ch. 23 of Freed [33]. Theorem 4.2.1. Let F̃ : M × [0, 1] → R be a one parameter family of smooth functions on an n−manifold M indexed by the variable s. We can replace F̃ with a substitute function (which we still call F̃) in a dense subspace of C∞(M × [0, 1]) which satisfies the following. Each Fs = F̃(−, s) is Morse except for finitely many s where there could exist degenerate critical points. For each such s0, and for each degenerate critical point p of Fs0 , there exist local coordinates (x1, . . . , xn, s) of M × [0, 1] centered at (p, s0) such that F̃(x1, . . . , xn, s) = x3 1 + ε1sx1 + ε2x2 2 + · · · + εnx2 n + C, where each εi ∈ {±1}, whose values depend on the partial derivatives of F̃, and C is the degenerate critical value. ε1 ε2 ε3 Bifurcation for increasing s 1 1 1 Impossible 1 1 −1 Destruction of critical pair 1 −1 1 Destruction of critical pair 1 −1 −1 Impossible −1 1 1 Impossible −1 1 −1 Creation of critical pair −1 −1 1 Creation of critical pair −1 −1 −1 Impossible Figure 4.2: The generic classification of bifurcations of critical points of the potential induced by knot isotopy. We will apply this theorem to the one parameter family of electric potentials in- duced by a knot isotopy. Suppose K0 and K1 are two smooth isotopic knots where the isotopy is parametrized by r : [0, 2π] × [0, 1] → R3. For fixed s0 ∈ [0, 1], we will say rs0 = r(−, s0) is the parametrization of the knot Ks0 , whose electric potential is Φs0 . We will now apply Theorem 4.2.1 to the one parameter familyΦs, and if necessary, replace 59 Φs with another nearby generic family so that the property mentioned in the theorem holds. We will label the coordinates from Theorem 4.2.1 as (x, y, z, s) so we have that the local coordinate formula of Φ at a specific s0 where Φs0 has a degenerate critical point is Φ(x, y, z, s) = x3 + ε1sx + ε2y2 + ε3z2 +C. Remark 4.2