Goto

Collaborating Authors

 mathematical formulae


Neural Machine Translation for Mathematical Formulae

arXiv.org Artificial Intelligence

We tackle the problem of neural machine translation of mathematical formulae between ambiguous presentation languages and unambiguous content languages. Compared to neural machine translation on natural language, mathematical formulae have a much smaller vocabulary and much longer sequences of symbols, while their translation requires extreme precision to satisfy mathematical information needs. In this work, we perform the tasks of translating from LaTeX to Mathematica as well as from LaTeX to semantic LaTeX. While recurrent, recursive, and transformer networks struggle with preserving all contained information, we find that convolutional sequence-to-sequence networks achieve 95.1% and 90.7% exact matches, respectively.


Minimalist Data Wrangling with Python

arXiv.org Artificial Intelligence

Minimalist Data Wrangling with Python is envisaged as a student's first introduction to data science, providing a high-level overview as well as discussing key concepts in detail. We explore methods for cleaning data gathered from different sources, transforming, selecting, and extracting features, performing exploratory data analysis and dimensionality reduction, identifying naturally occurring data clusters, modelling patterns in data, comparing data between groups, and reporting the results. This textbook is a non-profit project. Its online and PDF versions are freely available at https://datawranglingpy.gagolewski.com/.


Trader-Company Method: A Metaheuristic for Interpretable Stock Price Prediction

arXiv.org Machine Learning

Investors try to predict returns of financial assets to make successful investment. Many quantitative analysts have used machine learning-based methods to find unknown profitable market rules from large amounts of market data. However, there are several challenges in financial markets hindering practical applications of machine learning-based models. First, in financial markets, there is no single model that can consistently make accurate prediction because traders in markets quickly adapt to newly available information. Instead, there are a number of ephemeral and partially correct models called "alpha factors". Second, since financial markets are highly uncertain, ensuring interpretability of prediction models is quite important to make reliable trading strategies. To overcome these challenges, we propose the Trader-Company method, a novel evolutionary model that mimics the roles of a financial institute and traders belonging to it. Our method predicts future stock returns by aggregating suggestions from multiple weak learners called Traders. A Trader holds a collection of simple mathematical formulae, each of which represents a candidate of an alpha factor and would be interpretable for real-world investors. The aggregation algorithm, called a Company, maintains multiple Traders. By randomly generating new Traders and retraining them, Companies can efficiently find financially meaningful formulae whilst avoiding overfitting to a transient state of the market. We show the effectiveness of our method by conducting experiments on real market data.


Computer-Based Medical Consultations: MYCIN

AI Classics

This book has been adapted in large part from the author's doctoral thesis [Shortliffe, l 974b]. Portions of the work appeared previously in Computers And Biomedical Research [Shortliffe, 1973, l 975b], Mathematical Biosciences [Shortliffe, 1975a], and the Proceedings Of The Thirteenth San Diego Biomedical Symposium [Shortliffe, l 974a]. To Stanford's Medical Scientist Training Program, which is supported by the National Institutes of Health Contents


Towards Formula Translation using Recursive Neural Networks

arXiv.org Artificial Intelligence

While it has become common to perform automated translations on natural language, performing translations between different representations of mathematical formulae has thus far not been possible. We implemented the first translator for mathematical formulae based on recursive neural networks. We chose recursive neural networks because mathematical formulae inherently include a structural encoding. In our implementation, we developed new techniques and topologies for recursive tree-to-tree neural networks based on multi-variate multi-valued Long Short-Term Memory cells. We propose a novel approach for mini-batch training that utilizes clustering and tree traversal. We evaluate our translator and analyze the behavior of our proposed topologies and techniques based on a translation from generic LaTeX to the semantic LaTeX notation. We use the semantic LaTeX notation from the Digital Library for Mathematical Formulae and the Digital Repository for Mathematical Formulae at the National Institute for Standards and Technology. We find that a simple heuristics-based clustering algorithm outperforms the conventional clustering algorithms on the task of clustering binary trees of mathematical formulae with respect to their topology. Furthermore, we find a mask for the loss function, which can prevent the neural network from finding a local minimum of the loss function. Given our preliminary results, a complete translation from formula to formula is not yet possible. However, we achieved a prediction accuracy of 47.05% for predicting symbols at the correct position and an accuracy of 92.3% when ignoring the predicted position. Concluding, our work advances the field of recursive neural networks by improving the training speed and quality of training. In the future, we will work towards a complete translation allowing a machine-interpretation of LaTeX formulae.


Designing a GUI for Proofs - Evaluation of an HCI Experiment

arXiv.org Artificial Intelligence

Human-computer interaction (HCI) is the interdisciplinary study of interaction between people (users) and computers. Its main goal is making computers more user-friendly and easier to use. HCI is concerned with methodologies and processes for designing interfaces, with methods for implementing interfaces, with techniques for evaluating and comparing interfaces, with developing new interfaces and interaction techniques and with developing descriptive and predictive models and theories of interaction [9]. More often than not, user interfaces for theorem provers are developed as a mere add-on to the main proving engine. The result is an interaction design suitable for proof experts only.