AITopics | pyscf ipu

Collaborating Authors

pyscf ipu

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

1 Datasheet for QM1B

Neural Information Processing SystemsFeb-16-2026, 12:09:50 GMT

As recommended by the NeurIPS dataset and benchmark track, we documented QM1B and intended uses through the Datasheets for Datasets framework [1]. The goal of dataset datasheets as outlined by [1] is to provide a standardized process for documentating datasets. The authors of [1] present a list of carefully selected questions which dataset authors should answer. We hope our answers to these questions will facilitate better communication between us (the dataset creators) and future users of QM1B. For what purpose was the dataset created? Prior gaussian-based Density Functional Theory (DFT) datasets contained fewer than 20 million training examples.

artificial intelligence, inductive learning, machine learning, (19 more...)

Neural Information Processing Systems

Industry: Law (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.57)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.50)

Add feedback

ac7f98dd0b342edaf3be79844a180a6b-Paper-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsFeb-16-2026, 12:09:47 GMT

artificial intelligence, dataset, machine learning, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Connecticut > New Haven County > Wallingford (0.04)
North America > Canada > Quebec > Montreal (0.04)
Europe > United Kingdom (0.04)

Industry: Health & Medicine (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.96)

Add feedback

1 Datasheet for QM1B

Neural Information Processing SystemsOct-9-2025, 04:33:08 GMT

artificial intelligence, inductive learning, machine learning, (19 more...)

Neural Information Processing Systems

Industry: Law (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.57)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.50)

Add feedback

Generating QM1B with PySCF

Neural Information Processing SystemsOct-9-2025, 04:33:05 GMT

Processing have resulted in immense progress on downstream tasks.

artificial intelligence, dataset, machine learning, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Connecticut > New Haven County > Wallingford (0.04)
North America > Canada > Quebec > Montreal (0.04)
Europe > United Kingdom (0.04)

Industry: Health & Medicine (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.96)

Add feedback

Generating QM1B with PySCF$_{\text{IPU}}$

Mathiasen, Alexander, Helal, Hatem, Klaser, Kerstin, Balanca, Paul, Dean, Josef, Luschi, Carlo, Beaini, Dominique, Fitzgibbon, Andrew, Masters, Dominic

arXiv.org Artificial IntelligenceNov-2-2023

The emergence of foundation models in Computer Vision and Natural Language Processing have resulted in immense progress on downstream tasks. This progress was enabled by datasets with billions of training examples. Similar benefits are yet to be unlocked for quantum chemistry, where the potential of deep learning is constrained by comparatively small datasets with 100k to 20M training examples. These datasets are limited in size because the labels are computed using the accurate (but computationally demanding) predictions of Density Functional Theory (DFT). Notably, prior DFT datasets were created using CPU supercomputers without leveraging hardware acceleration. In this paper, we take a first step towards utilising hardware accelerators by introducing the data generator PySCF$_{\text{IPU}}$ using Intelligence Processing Units (IPUs). This allowed us to create the dataset QM1B with one billion training examples containing 9-11 heavy atoms. We demonstrate that a simple baseline neural network (SchNet 9M) improves its performance by simply increasing the amount of training data without additional inductive biases. To encourage future researchers to use QM1B responsibly, we highlight several limitations of QM1B and emphasise the low-resolution of our DFT options, which also serves as motivation for even larger, more accurate datasets. Code and dataset are available on Github: http://github.com/graphcore-research/pyscf-ipu

dataset, neural network, pyscf ipu, (12 more...)

arXiv.org Artificial Intelligence

2311.01135

Country:

North America > United States > Connecticut > New Haven County > Wallingford (0.04)
North America > Canada > Quebec > Montreal (0.04)
Europe > United Kingdom (0.04)

Genre: Research Report (0.64)

Industry: Health & Medicine (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback