Remarks on the disproof of the unit distance conjecture
Abstract: We present a short, digested, human-verified version of the recent OpenAI-generated counterexample to the Erdős unit distance conjecture, and a sequence of reflections on it. The argument relies crucially on ideas that may, at least in retrospect, be attributed to Ellenberg-Venkatesh, Golod-Shafarevich, and Hajir-Maire-Ramakrishna.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Explain it Like I'm 14
What is this paper about?
This paper explains a surprising discovery about a famous geometry question first asked by the mathematician Paul Erdős: If you put n points in the plane, how many pairs can be exactly distance 1 apart? For decades, many experts believed the answer should be “almost linear,” roughly like . This paper presents a clear, human-checked version of an AI-generated construction that disproves that belief. It shows there are point sets with at least unit-distance pairs for some fixed positive number . In short: you can have many more “unit-length” pairs than people thought.
The big questions the paper tackles
- Can we build large point sets in the plane with way more than linear (in n) many unit-distance pairs?
- Is Erdős’s long-standing “almost linear” upper bound for unit distances false?
- Can ideas from number theory (a branch of math about whole numbers and their generalizations) help build these surprising point sets?
How did they approach it? (The main ideas, in everyday language)
The authors use a clever bridge between number theory and geometry. Here’s the idea in simple terms:
- Think “high-dimensional grid” instead of a simple 2D grid: Mathematicians can build very large, well-structured grids of points not just in the plane, but in a much higher-dimensional space that behaves nicely. These grids come from special number systems called number fields.
- Find many “unit steps” on the boundary: In these high-dimensional grids, the authors look for many special step vectors whose length (in the complex plane) is exactly 1. Think of them as many “arrows” you can add to a point without changing the arrow’s length.
- Window-and-shadow trick: They focus on a big window (a large box) in this high-dimensional grid and then project (take a “shadow” of) these points onto one complex coordinate. That one complex coordinate is just another way to view the usual plane (). If there are many unit-length arrows inside the window, then after projection you get many pairs of points exactly distance 1 apart in the plane.
Two key ingredients make this work:
- A geometry-of-numbers lemma: It says that if your big grid has many “unit-length directions” on its boundary, then by choosing a suitable window and projecting to one coordinate, you get a planar point set with lots of unit-distance pairs.
- A pigeonhole-principle lemma (from number theory): It guarantees there are many special numbers of absolute value 1 (think “perfect unit-length arrows” in all the right ways) by cleverly combining many primes that “split nicely” in the bigger number system. This is where deep algebraic number theory comes in.
To supply enough of these special numbers, the authors use:
- CM fields and embeddings: A CM field is a special kind of number field where “being length 1 in one view” automatically means “length 1 in all views.” That makes the unit-length condition robust.
- Class field towers with bounded complexity: Using a powerful result (Golod–Shafarevich), they build an infinite “tower” of number fields that keep certain measurements (called root discriminants) under control while the dimension grows. As the dimension grows, you get more room for the window-and-shadow trick to produce many unit distances.
- Split primes: A fixed rational prime (like 101) that splits into many smaller pieces in these fields helps create many of those unit-length arrows. The more it splits as the dimension grows, the more arrows you get.
Putting all this together, they construct ever larger planar point sets whose number of unit-distance pairs grows faster than by a fixed power.
What did they find, and why is it important?
Main result:
- There is a sequence of point sets in the plane with size going to infinity where the number of unit-distance pairs is at least for some fixed .
- This disproves the long-standing conjecture that the maximum number of unit distances among points is at most .
Why this matters:
- It overturns a widely believed upper bound and changes how we think about distances among points.
- It shows that deep tools from number theory can solve problems in discrete geometry in unexpected ways.
- It demonstrates that AI can suggest powerful, non-obvious constructions, which humans can then verify and simplify.
What could this mean for the future?
- New connections: This work strengthens the link between number theory and geometry. Similar techniques might help tackle other tough problems about distances, angles, or patterns among points.
- Better constructions: The “high-dimensional grid + projection” method, powered by number fields and split primes, may yield new geometric configurations with extreme or optimal properties.
- AI and mathematics: The original idea came from an AI system. Humans refined and verified it. This hints at a future where AI proposes bold approaches, and mathematicians turn those ideas into clear, reliable results.
In short, the paper shows that thinking far outside the usual 2D grid—by using advanced number systems and high-dimensional structures—can produce far more unit distances than expected. It’s a striking example of cross-pollination between fields and a milestone for AI-assisted discovery in mathematics.
Knowledge Gaps
Knowledge gaps, limitations, and open questions
Below is a focused list of what the paper leaves unresolved, uncertain, or unexplored, phrased to be actionable for future work.
- Effectivity and explicit constructions
- Provide explicit algorithms to construct the fields in the class field tower, the split primes, and the resulting point sets, with concrete bounds on sizes and running time (e.g., produce a point set with n up to a practical threshold and verify the unit-distance count).
- Make the use of Chebotarev effective: quantify the time/height needed to find split primes with the required congruence (e.g., q ≡ 1 mod 4) and give explicit splitting density statements along the tower.
- Size of the improvement over 1 and optimization of parameters
- The achieved exponent above 1 is astronomically small (e.g., “≈ 1 + 6.24 × 10{-38}” as written). Derive optimized choices of T, S, k_j, p, window shape, and lattice scaling that substantially increase the constant ε in |P|{1+ε}, ideally to a quantitatively meaningful value.
- Replace the crude bound h_K ≤ |Disc K| with sharper class number bounds (possibly conditional, e.g., GRH-based) to enlarge u in Lemma 1 and improve the exponent.
- Minimize the denominator D in Lemma 2 while maintaining a large |U|; for example, replace the pigeonhole on ideal classes with a pigeonhole over ray class groups, use controlled conductors, or impose congruence conditions to get many elements of modulus 1 with much smaller denominators.
- Geometry-of-numbers and lattice-analytic choices
- Explore alternative window shapes and norms (e.g., Euclidean balls instead of sup-norm polydiscs, or balanced fundamental domains via Minkowski/LLL reduction) to reduce the skewness parameter v and improve counts.
- Give general criteria ensuring the coordinate projection is injective for broader classes of lattices (beyond scaled O_K) and quantify how close-to-injective projections impact the lower bound when collisions occur.
- Optimize the averaging/unfolding step in Lemma 1 (choice of translate a) to improve constants and reduce boundary losses.
- Number-field structural constraints
- The method currently requires CM fields to ensure |u| = 1 across all complex embeddings. Develop extensions that work in non-CM settings (e.g., controlling other archimedean embeddings, or using different constructions that certify unit modulus in the projected coordinate only).
- Quantify the trade-off between bounded root discriminant and the availability/density of split rational primes in infinite towers. Identify towers with minimal root discriminant r that maximize |U| and minimize D.
- Make Proposition 1 fully explicit: construct particular towers with infinitely many q ≡ 1 mod 4 split primes and give quantitative splitting densities and effective heights.
- Structural properties of the resulting point sets
- Characterize the unit-distance graph produced: quantify typical degrees, forbidden subgraphs (e.g., do K_{2,3}-type constraints persist?), and incidence patterns. Assess whether the construction also forces many occurrences of other distances.
- Investigate whether these sets interact with known incidence bounds (e.g., crossing number methods) in a way that hints at new combinatorial phenomena or upper-bound refinements.
- Scope of the phenomenon and generalizations
- Extend the method to higher dimensions (Rd), other norms (e.g., ℓ_p or generic norms), or other prescribed distances (beyond 1), and identify which arithmetic features are essential for each extension.
- Analyze whether similar number-theoretic constructions yield large lower bounds for related problems (e.g., distinct distances, repeated distances at multiple prescribed radii), and identify the barriers.
- Quantitative counting in towers
- Develop asymptotic formulas (not just lower bounds) for the number of magnitude-1 elements with constrained denominators in CM towers as [K:Q] → ∞; quantify the exponential growth rate and dependence on ramification/splitting.
- Systematically optimize the choice of s, k_j, and the set of rational primes used in Lemma 2 to maximize |U| subject to constraints on D and root discriminant.
- Verification and empirical support
- Produce computational experiments for moderately sized fields to instantiate the construction, verify injectivity, count unit pairs, and compare observed exponents with theoretical predictions.
- Check the numerical estimate leading to “≈ 1 + 6.24 × 10{-38}” for arithmetic or typographical errors and give a transparent derivation of ε for specific parameter choices.
- Arithmetic refinements and alternative sources of modulus-1 elements
- Explore generating modulus-1 elements via controlled S-unit constructions or special values of Hecke characters to increase |U| without inflating denominators.
- Investigate whether towers with prescribed ray class constraints can yield larger families of unit-modulus elements and better lattice packing properties in Cf.
- Limitations of the approach
- Assess whether this number-theoretic pathway has intrinsic barriers to achieving ε of order 1/log n or larger, and identify what would be required (new arithmetic input, different geometric strategy) to significantly break the current “existential-but-tiny” ε ceiling.
- Evaluate the necessity of deep class field tower machinery: can simpler or more combinatorial constructions produce comparable or stronger lower bounds without heavy algebraic number theory?
Practical Applications
Overview
This paper presents a human-verified, streamlined account of an AI-generated counterexample to the Erdős unit distance conjecture. The core technical contributions are:
- A geometry-of-numbers “window” lemma to turn high-dimensional lattice structure into many unit distances in the plane.
- A pigeonhole/class-group lemma that produces many algebraic numbers of modulus 1 with bounded denominators in CM fields.
- A scalable construction via class field towers (Golod–Shafarevich type) with bounded root discriminant and plentiful split primes (including a Frobenius-cutting strategy).
These ideas enable explicit, scalable constructions of planar point sets with more than n{1+o(1)} unit distances. Beyond settling a famous conjecture, the methods yield immediate utilities (e.g., benchmark generators and robustness testing for geometric algorithms) and long-term avenues (e.g., code/lattice design, AI-assisted proof tooling, and policy for AI-generated mathematics).
Below are practical applications, grouped by deployment timeline.
Immediate Applications
The following applications can be prototyped or deployed now using standard tools (SageMath, PARI/GP, Magma, CGAL, networkx, shapely/GEOS) and existing workflows.
- Robustness and adversarial benchmarking for computational geometry
- Sector: software, robotics, GIS/CAD
- Use case: Generate planar point sets with unusually many equal distances to stress-test algorithms (Delaunay/Voronoi, nearest-neighbor, R-/KD-trees, collinearity/degeneracy handling, snapping, clustering).
- Product/workflow: A point-set generator that constructs CM fields K with specified split primes, embeds O_K into Cf (Minkowski embedding), applies the bounded “window,” and projects to 2D; outputs adversarial datasets for benchmarking.
- Assumptions/dependencies: Availability of number-field computations (SageMath/PARI/GP) and basic numerical stability; performance tuning for large degrees.
- Worst-case and regression tests for unit disk graph algorithms
- Sector: telecommunications, wireless networking, graph analytics
- Use case: Evaluate algorithms for scheduling, channel assignment, coloring, clique/independent-set approximations on dense unit disk graphs induced by these constructions.
- Product/workflow: Synthetic UDG generators integrated with network simulators.
- Assumptions/dependencies: Sufficiently large instances may require memory-aware representations; need reproducible seeds and parameterizations for fair comparisons.
- Stress-testing SLAM, localization, and range-based sensing pipelines
- Sector: robotics, autonomous systems, AR/VR
- Use case: Construct datasets that exacerbate distance ambiguities (many equal distances) to test identifiability, loop closure, and outlier rejection.
- Product/workflow: Plug-in dataset generator for SLAM frameworks; evaluate failure modes and design degeneracy detectors.
- Assumptions/dependencies: Primarily 2D; extension to 3D requires analogous constructions (still feasible via number-field embeddings in higher dimensions).
- CAD/GIS UI robustness and snapping/tolerance policy
- Sector: design/engineering software, mapping
- Use case: Validate snapping rules, constraint solvers, and geometric predicates under extreme equal-distance degeneracy.
- Product/workflow: Regression test suites integrating generated point sets; automated detection of ambiguous snapping.
- Assumptions/dependencies: Numeric robustness and predicate exactness (e.g., via rational/algebraic kernels).
- Academic toolkits for discrete geometry and extremal combinatorics
- Sector: academia (math/CS)
- Use case: Ready-to-use libraries for constructing high-unit-distance sets for experiments, lectures, and assignments.
- Product/workflow: Open-source package (Sage/Python) encapsulating Lemma L:unit_expansion and Lemma “pigeons,” with presets for class field towers and split primes.
- Assumptions/dependencies: Documentation and precomputed field data to lower entry barriers.
- Education and research assistance via “hint sequencing”
- Sector: education, software
- Use case: LLM-driven proof assistants that provide graded “hint sequences,” inspired by the paper’s discussion of “Kolmogorov complexity modulo experts.”
- Product/workflow: Plugins for Lean/Isabelle/Coq or Jupyter notebooks that deliver surprise-weighted hints, track hint length, and support human-in-the-loop proof discovery.
- Assumptions/dependencies: Availability of LLMs fine-tuned on formal/informal proofs; institutional willingness to integrate AI guidance.
- Journal and grant-policy templates for AI-generated mathematics
- Sector: policy, academia, publishing
- Use case: Introduce verification pipelines, provenance statements, and reproducibility checklists for AI-aided results.
- Product/workflow: Editorial policies for disclosure of AI involvement; artifact evaluation (code, field data, generators).
- Assumptions/dependencies: Community consensus on standards; tooling for verifiable archives.
- Benchmark families for code and lattice experiments (replication of known results)
- Sector: coding theory, sphere packing (R&D)
- Use case: Use known class-field-tower lattices (referenced in the paper) to replicate/compare code/packing properties.
- Product/workflow: Scripts to generate lattices from towers; compare distance spectra, density, and decoding performance.
- Assumptions/dependencies: Existing literature provides constructions; this paper’s perspective informs parameter choices but does not (by itself) guarantee improvements over best-known codes.
Long-Term Applications
These applications likely require further research, scaling, or integration with domain-specific constraints and performance targets.
- Space-time and constant-modulus code design from CM fields
- Sector: telecommunications (MIMO, 6G), satellite/IoT
- Use case: Explore codebooks derived from CM fields with many modulus-1 elements to reduce PAPR while maintaining diversity/determinant criteria.
- Product/workflow: Algebraic code design pipeline that integrates the paper’s modulus-1 constructions with minimum-determinant constraints and efficient decoding.
- Assumptions/dependencies: Proof-of-concept analyses of coding gain, complexity, and robustness; bridging to NVD and shaping gains.
- Improved high-dimensional lattice packings and coding schemes via class field towers
- Sector: communications, storage, error-correction
- Use case: Leverage bounded root discriminant towers and split primes to engineer lattices/codes with favorable asymptotics (building on prior works cited in the paper).
- Product/workflow: Parametric exploration (dimension, discriminant, splitting patterns) with automated search; practical encoders/decoders.
- Assumptions/dependencies: Strong algorithmic support for huge-degree fields; hardware-accelerated decoding; demonstration of net system gains.
- Hard-instance generators for geometric ML models
- Sector: AI/ML robustness, AV/robotics
- Use case: Systematically produce adversarial 2D/3D point clouds with repeated-distance degeneracies to test graph-based or attention-based models in perception and mapping.
- Product/workflow: Integration with point-cloud training pipelines; robustness metrics tied to geometric degeneracy.
- Assumptions/dependencies: Extension from 2D constructions to 3D analogues; scalable generation at dataset sizes.
- New algorithmic heuristics informed by revised unit-distance extremals
- Sector: network planning, geometric optimization
- Use case: Revisit heuristics and worst-case analyses for unit-disk and proximity graphs with updated understanding of extremal structures.
- Product/workflow: Heuristic tuning that exploits structure (e.g., anticipated clustering on “window boundaries”); algorithm selection policies based on degeneracy detectors.
- Assumptions/dependencies: Empirical demonstrations that new constructions change performance-phase diagrams in realistic workloads.
- Cryptographic research directions based on controlled splitting in infinite towers
- Sector: cryptography (exploratory)
- Use case: Investigate whether class-group/ideal problems over carefully engineered towers (bounded root discriminant, many split primes) yield hard instances or novel primitives.
- Product/workflow: Hardness studies; parameterization guides; potential class-group action or isogeny analogues in broader settings.
- Assumptions/dependencies: Rigorous security reductions; quantum-era threat models; careful avoidance of trapdoors introduced by engineered splitting.
- Formal proof co-pilots with surprisal-weighted hint optimization
- Sector: education, research tooling
- Use case: Operationalize “Kolmogorov complexity modulo experts” to measure and minimize hint length while maximizing learning/insight.
- Product/workflow: A/B-tested systems for classrooms and research groups; dashboards scoring hint sequences and proof-reconstruction times.
- Assumptions/dependencies: Large-scale user studies; alignment with formal proof libraries; ethical/assessment frameworks.
- Standards for AI-human collaboration and credit in mathematics
- Sector: policy, academia
- Use case: Develop frameworks for attribution, authorship, and responsibility when AI contributes nontrivially to proofs.
- Product/workflow: Best-practice documents, data-sharing norms, audit trails for model prompts/outputs.
- Assumptions/dependencies: Broad stakeholder buy-in (societies, journals, funders).
- Sensor-network localization and range-only system design
- Sector: IoT, industrial automation
- Use case: Use extremal equal-distance constructions to establish identifiability limits and to design anchor layouts resistant to ambiguity.
- Product/workflow: Layout optimization tools that avoid “unit-distance pathologies”; certification tests before deployment.
- Assumptions/dependencies: Translation of 2D theory to specific sensor noise models and deployment constraints.
- Curriculum and training on cross-pollination between discrete geometry and number theory
- Sector: education (graduate/advanced undergraduate)
- Use case: New modules illustrating how deep number theory yields geometric extremals; pipelines from class field theory to combinatorial constructions.
- Product/workflow: Course materials and computational labs (Sage/Magma) pairing proofs with computational experiments.
- Assumptions/dependencies: Faculty expertise; curated datasets and field instances to lower the learning curve.
Notes on feasibility across items:
- Many industry-facing uses rely on a packaged generator for “many equal distances” point sets; this is straightforward given the paper’s lemmas and standard number-theory software, though very large instances may be computationally heavy.
- Code/lattice applications require significant additional constraints beyond “many modulus-1 elements,” including diversity criteria and practicality of decoding.
- AI-in-math policy and education tools depend more on community processes than on technical barriers, so early pilots are realistic.
Glossary
- Archimedean valuation: An absolute value coming from an embedding of a number field into the real or complex numbers; used to measure sizes at “infinite” places. Example: "small archimedian valuations and also bounded denominator."
- Chebotarev density theorem: A theorem describing the distribution of Frobenius elements and splitting of primes in Galois extensions. Example: "Every time the Chebotarev density theorem is applied to obtain a split rational prime of a number field , we instead apply the Chebotarev density theorem to the number field ."
- Class field tower: An ascending sequence of number fields where each extension is the (maximal) unramified abelian extension of the previous field. Example: "be a finite layer of an infinite class field tower of Golod-Shafarevich type"
- Class group: The group of ideal classes of a number field’s ring of integers, measuring failure of unique factorization. Example: "Use the pigeonhole principle on the classes in "
- Class number: The size of the class group of a number field; a central invariant in algebraic number theory. Example: "in terms of the class number "
- CM field: A totally imaginary quadratic extension of a totally real number field (a special type of number field with complex multiplication). Example: "It turns out to be convenient to assume is a CM field."
- Complex conjugation automorphism: The nontrivial automorphism c of a CM field K sending each element to its complex conjugate. Example: "with complex conjugation automorphism ."
- Covolume: The volume of a fundamental domain of a lattice in a Euclidean space. Example: "Let ."
- Dirichlet's unit theorem: Describes the structure and rank of the unit group of a number field’s ring of integers. Example: "Dirichlet's unit theorem shows that "
- Discriminant: A number field invariant encoding ramification and arithmetic complexity; the root discriminant is its normalized version. Example: "the main enemies, the class number and discriminant "
- Frobenius element: For an unramified prime, a canonical element of the Galois group capturing its action on residue fields, central in Chebotarev theory. Example: "the Frobenius elements at the finite places of ."
- Frattini quotient: The quotient of a (profinite) group by its Frattini subgroup; its dimension measures the minimal number of generators. Example: "(i.e. the Frattini quotient)."
- Galois group: The group of field automorphisms of a Galois extension; encodes the symmetry of the extension. Example: "Let be the Galois group of the maximal pro-$2$ extension of "
- Geometry of numbers: The study of lattice points and convex bodies in Euclidean space, often using volume and counting arguments. Example: "First, we give the geometry-of-numbers lemma."
- Golod–Shafarevich theorem: A result giving criteria for certain pro-p groups (and thus class field towers) to be infinite. Example: "and is infinite by the Golod-Shafarevich Theorem \cite{GolodShafarevich1964ClassFieldTower}."
- Minkowski embedding: The canonical embedding of a number field into a product of real and complex spaces, turning the ring of integers into a lattice. Example: "the covolume of in the Minkowski embedding of in is ."
- Minkowski sum: The set sum A+B = {a+b} of two subsets of a vector space; used in convex and additive geometry. Example: "the sumset (Minkowski sum) of and "
- Number field: A finite-degree field extension of the rational numbers Q. Example: "Let be a number field embedded in ."
- Polydisc: A Cartesian product of discs in complex Euclidean space. Example: "a polydisc of ``radius'' ."
- Prime ideal: A prime in the ring of integers of a number field; basic building block for ideal factorization. Example: "Let be pairwise distinct prime ideals of "
- Principal ideal: An ideal generated by a single element in a ring. Example: "pairwise distinct principal ideals "
- Pro-2 extension: A Galois extension whose Galois group is a pro-2 group (inverse limit of 2-groups). Example: "the maximal pro-$2$ extension of "
- Ramification index: The multiplicity with which a prime ideal in a base field appears in the factorization of a prime in an extension. Example: "Let denote the ramification index of a prime ideal in a number field ."
- Ray class field: The maximal abelian extension of a number field associated to a given modulus; central object in class field theory. Example: "equiangular lines, ray class fields, and the Stark conjectures"
- Root discriminant: The discriminant of a number field raised to the power 1/[K:Q]; used to compare discriminants across degrees. Example: "have bounded root discriminant"
- Split completely: A property where a rational prime factors into distinct degree-1 primes in an extension field. Example: "split completely at all primes in "
- Sup-norm: The maximum absolute value among coordinates of a vector; here, across embeddings. Example: "the sup-norm over all conjugate absolute values on ."
- Tame ramification: Ramification where the ramification index is not divisible by the residue characteristic. Example: "they are ramified tamely"
- Totally real field: A number field whose embeddings into C all land in R. Example: "where is a totally real number field"
- Weil height: A canonical height function measuring the arithmetic size of algebraic numbers. Example: "of bounded Weil height"
Collections
Sign up for free to add this paper to one or more collections.