Goto

Collaborating Authors

 elliptic curve


A 'Grand Unified Theory' of Math Just Got a Little Bit Closer

WIRED

The original version of this story appeared in Quanta Magazine. In 1994, an earthquake of a proof shook up the mathematical world. The mathematician Andrew Wiles had finally settled Fermat's Last Theorem, a central problem in number theory that had remained open for over three centuries. The proof didn't just enthral mathematicians--it made the front page of The New York Times. But to accomplish it, Wiles (with help from the mathematician Richard Taylor) first had to prove a more subtle intermediate statement--one with implications that extended beyond Fermat's puzzle.


Learning Euler Factors of Elliptic Curves

Babei, Angelica, Charton, François, Costa, Edgar, Huang, Xiaoyu, Lee, Kyu-Hwan, Lowry-Duda, David, Narayanan, Ashvni, Pozdnyakov, Alexey

arXiv.org Artificial Intelligence

We apply transformer models and feedforward neural networks to predict Frobenius traces $a_p$ from elliptic curves given other traces $a_q$. We train further models to predict $a_p \bmod 2$ from $a_q \bmod 2$, and cross-analysis such as $a_p \bmod 2$ from $a_q$. Our experiments reveal that these models achieve high accuracy, even in the absence of explicit number-theoretic tools like functional equations of $L$-functions. We also present partial interpretability findings.


Mathematical Data Science

Douglas, Michael R., Lee, Kyu-Hwan

arXiv.org Artificial Intelligence

In this article we discuss an approach to doing this which one can call mathematical data science. In this paradigm, one studies mathematical objects collectively rather than individually, by creating datasets and doing machine learning experiments and interpretations. Broadly speaking, the field of data science is concerned with assembling, curating and analyzing large datasets, and developing methods which enable its users to not just answer predetermined questions about the data but to explore it, make simple descriptions and pictures, and arrive at novel insights. This certainly sounds promising as a tool for mathematical discovery! Mathematical data science is not new and has historically led to very important results. A famous example is the work of Birch and Swinnerton-Dyer leading to their conjecture [BSD65], based on computer generation of elliptic curves and linear regression analysis of the resulting data. However, the field really started to take off with the deep learning revolution and with the easy access to ML models provided by platforms such as Py-Torch and TensorFlow, and built into computer algebra systems such as Mathematica, Magma and SageMath.


Machine Learning Approaches to the Shafarevich-Tate Group of Elliptic Curves

Babei, Angelica, Banwait, Barinder S., Fong, AJ, Huang, Xiaoyu, Singh, Deependra

arXiv.org Machine Learning

We train machine learning models to predict the order of the Shafarevich-Tate group of an elliptic curve over $\mathbb{Q}$. Building on earlier work of He, Lee, and Oliver, we show that a feed-forward neural network classifier trained on subsets of the invariants arising in the Birch--Swinnerton-Dyer conjectural formula yields higher accuracies ($> 0.9$) than any model previously studied. In addition, we develop a regression model that may be used to predict orders of this group not seen during training and apply this to the elliptic curve of rank 29 recently discovered by Elkies and Klagsbrun. Finally we conduct some exploratory data analyses and visualizations on our dataset. We use the elliptic curve dataset from the L-functions and modular forms database (LMFDB).


LOCR: Location-Guided Transformer for Optical Character Recognition

Sun, Yu, Zhou, Dongzhan, Lin, Chen, He, Conghui, Ouyang, Wanli, Zhong, Han-Sen

arXiv.org Artificial Intelligence

Academic documents are packed with texts, equations, tables, and figures, requiring comprehensive understanding for accurate Optical Character Recognition (OCR). While end-to-end OCR methods offer improved accuracy over layout-based approaches, they often grapple with significant repetition issues, especially with complex layouts in Out-Of-Domain (OOD) documents.To tackle this issue, we propose LOCR, a model that integrates location guiding into the transformer architecture during autoregression. We train the model on a dataset comprising over 77M text-location pairs from 125K academic document pages, including bounding boxes for words, tables and mathematical symbols. LOCR adeptly handles various formatting elements and generates content in Markdown language. It outperforms all existing methods in our test set constructed from arXiv, as measured by edit distance, BLEU, METEOR and F-measure.LOCR also reduces repetition frequency from 4.4% of pages to 0.5% in the arXiv dataset, from 13.2% to 1.3% in OOD quantum physics documents and from 8.1% to 1.8% in OOD marketing documents. Additionally, LOCR features an interactive OCR mode, facilitating the generation of complex documents through a few location prompts from human.


Murmurations of elliptic curves

He, Yang-Hui, Lee, Kyu-Hwan, Oliver, Thomas, Pozdnyakov, Alexey

arXiv.org Machine Learning

We investigate the average value of the $p$th Dirichlet coefficients of elliptic curves for a prime p in a fixed conductor range with given rank. Plotting this average yields a striking oscillating pattern, the details of which vary with the rank. Based on this observation, we perform various data-scientific experiments with the goal of classifying elliptic curves according to their ranks.


Bitcoin & The "Threat" of Quantum Computing

#artificialintelligence

In recent years there has been a growing interest in the field of Quantum Computing. It is a novel and incredibly complex topic that has turned into a hot button item. . If it lives up to it's potential Quantum Computing has the potential to be a truly paradigm shifting and industry disrupting [if not destroying] technology. More recently, Quantum Computing, specifically Quantum Computers, have been identified as the inevitable downfall of Bitcoin. The impossible to comprehend processing power, the sheer inexorable force they will bring to bear on the world will be a force unlike we have ever seen, one that no computer system can resist. Quantum Computers are the Boogie Man that every Bitcoin HODLer, Trader, Pundit, and Investor should stay up at night worrying about.


Machine-Learning Arithmetic Curves

He, Yang-Hui, Lee, Kyu-Hwan, Oliver, Thomas

arXiv.org Machine Learning

We show that standard machine-learning algorithms may be trained to predict certain invariants of low genus arithmetic curves. Using datasets of size around one hundred thousand, we demonstrate the utility of machine-learning in classification problems pertaining to the BSD invariants of an elliptic curve (including its rank and torsion subgroup), and the analogous invariants of a genus 2 curve. Our results show that a trained machine can efficiently classify curves according to these invariants with high accuracies (>0.97). For problems such as distinguishing between torsion orders, and the recognition of integral points, the accuracies can reach 0.998.


Machine-Learning the Sato--Tate Conjecture

He, Yang-Hui, Lee, Kyu-Hwan, Oliver, Thomas

arXiv.org Machine Learning

We apply some of the latest techniques from machine-learning to the arithmetic of hyperelliptic curves. More precisely we show that, with impressive accuracy and confidence (between 99 and 100 percent precision), and in very short time (matter of seconds on an ordinary laptop), a Bayesian classifier can distinguish between Sato-Tate groups given a small number of Euler factors for the L-function. Our observations are in keeping with the Sato-Tate conjecture for curves of low genus. For elliptic curves, this amounts to distinguishing generic curves (with Sato-Tate group SU(2)) from those with complex multiplication. In genus 2, a principal component analysis is observed to separate the generic Sato-Tate group USp(4) from the non-generic groups. Furthermore in this case, for which there are many more non-generic possibilities than in the case of elliptic curves, we demonstrate an accurate characterisation of several Sato-Tate groups with the same identity component. Throughout, our observations are verified using known results from the literature and the data available in the LMFDB. The results in this paper suggest that a machine can be trained to learn the Sato-Tate distributions and may be able to classify curves much more efficiently than the methods available in the literature.


Machine Learning meets Number Theory: The Data Science of Birch-Swinnerton-Dyer

Alessandretti, Laura, Baronchelli, Andrea, He, Yang-Hui

arXiv.org Machine Learning

Empirical analysis is often the first step towards the birth of a conjecture. This is the case of the Birch-Swinnerton-Dyer (BSD) Conjecture describing the rational points on an elliptic curve, one of the most celebrated unsolved problems in mathematics. Here we extend the original empirical approach, to the analysis of the Cremona database of quantities relevant to BSD, inspecting more than 2.5 million elliptic curves by means of the latest techniques in data science, machine-learning and topological data analysis. Key quantities such as rank, Weierstrass coefficients, period, conductor, Tamagawa number, regulator and order of the Tate-Shafarevich group give rise to a high-dimensional point-cloud whose statistical properties we investigate. We reveal patterns and distributions in the rank versus Weierstrass coefficients, as well as the Beta distribution of the BSD ratio of the quantities. Via gradient boosted trees, machine learning is applied in finding inter-correlation amongst the various quantities. We anticipate that our approach will spark further research on the statistical properties of large datasets in Number Theory and more in general in pure Mathematics.