AITopics | Cheon, Gowoon

Collaborating Authors

Cheon, Gowoon

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

CURIE: Evaluating LLMs On Multitask Scientific Long Context Understanding and Reasoning

Cui, Hao, Shamsi, Zahra, Cheon, Gowoon, Ma, Xuejian, Li, Shutong, Tikhanovskaya, Maria, Norgaard, Peter, Mudur, Nayantara, Plomecka, Martyna, Raccuglia, Paul, Bahri, Yasaman, Albert, Victor V., Srinivasan, Pranesh, Pan, Haining, Faist, Philippe, Rohr, Brian, Statt, Michael J., Morris, Dan, Purves, Drew, Kleeman, Elise, Alcantara, Ruth, Abraham, Matthew, Mohammad, Muqthar, VanLee, Ean Phing, Jiang, Chenfei, Dorfman, Elizabeth, Kim, Eun-Ah, Brenner, Michael P, Jain, Viren, Ponda, Sameera, Venugopalan, Subhashini

arXiv.org Artificial IntelligenceMar-14-2025

Scientific problem-solving involves synthesizing information while applying expert knowledge. We introduce CURIE, a scientific long-Context Understanding,Reasoning and Information Extraction benchmark to measure the potential of Large Language Models (LLMs) in scientific problem-solving and assisting scientists in realistic workflows. This benchmark introduces ten challenging tasks with a total of 580 problems and solution pairs curated by experts in six disciplines - materials science, condensed matter physics, quantum computing, geospatial analysis, biodiversity, and proteins - covering both experimental and theoretical work-flows in science. We evaluate a range of closed and open LLMs on tasks in CURIE which requires domain expertise, comprehension of long in-context information,and multi-step reasoning. While Gemini Flash 2.0 and Claude-3 show consistent high comprehension across domains, the popular GPT-4o and command-R+ fail dramatically on protein sequencing tasks. With the best performance at 32% there is much room for improvement for all models. We hope that insights gained from CURIE can guide the future development of LLMs in sciences. Evaluation code and data are in https://github.com/google/curie

information, large language model, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2503.13517

Country:

Europe (0.67)
North America > United States (0.67)
Africa > Cameroon > Gulf of Guinea (0.28)

Genre:

Workflow (1.00)
Research Report (1.00)

Industry:

Education (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Dataset of Random Relaxations for Crystal Structure Search of Li-Si System

Cheon, Gowoon, Yang, Lusann, McCloskey, Kevin, Reed, Evan J., Cubuk, Ekin D.

arXiv.org Artificial IntelligenceMar-8-2023

Crystal structure search is a long-standing challenge in materials design. We present a dataset of more than 100,000 structural relaxations of potential battery anode materials from randomized structures using density functional theory calculations. We illustrate the usage of the dataset by training graph neural networks to predict structural relaxations from randomly generated structures. Our models directly predict stresses in addition to forces, which allows them to accurately simulate relaxations of both ionic positions and lattice vectors. We show that models trained on the molecular dynamics simulations fail to simulate relaxations from random structures, while training on our data leads to up to two orders of magnitude decrease in error for the same task. Our model is able to find an experimentally verified structure of a stoichiometry held out from training. We find that randomly perturbing atomic positions during training improves both the accuracy and out of domain generalization of the models.

artificial intelligence, machine learning, trajectory, (18 more...)

arXiv.org Artificial Intelligence

2012.0292

Country: North America > United States (0.28)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback