AITopics | Materials

Collaborating Authors

Materials

Optimized Architectures for Kolmogorov-Arnold Networks

arXiv.org Machine LearningDec-16-2025

Efforts to improve Kolmogorov-Arnold networks (KANs) with architectural enhancements have been stymied by the complexity those enhancements bring, undermining the interpretability that makes KANs attractive in the first place. Here we study overprovisioned architectures combined with sparsification to learn compact, interpretable KANs without sacrificing accuracy. Crucially, we focus on differentiable sparsification, turning architecture search into an end-to-end optimization problem. Across function approximation benchmarks, dynamical systems forecasting, and real-world prediction tasks, we demonstrate competitive or superior accuracy while discovering substantially smaller models. Overprovisioning and sparsification are synergistic, with the combination outperforming either alone. The result is a principled path toward models that are both more expressive and more interpretable, addressing a key tension in scientific machine learning.

activation function, architecture, sparsification, (14 more...)

arXiv.org Machine Learning

2512.12448

Country:

North America > United States > Vermont > Chittenden County > Burlington (0.14)
Europe > Russia (0.04)
Asia > Russia (0.04)
Asia > Japan (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Materials (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

Benchmarking Multimodal LLMs on Recognition and Understanding over Chemical Tables

Zhou, Yitong, Cheng, Mingyue, Mao, Qingyang, Luo, Yucong, Liu, Qi, Li, Yupeng, Zhang, Xiaohan, Liu, Deguang, Li, Xin, Chen, Enhong

arXiv.org Artificial IntelligenceDec-12-2025

With the widespread application of multimodal large language models in scientific intelligence, there is an urgent need for more challenging evaluation benchmarks to assess their ability to understand complex scientific data. Scientific tables, as core carriers of knowledge representation, combine text, symbols, and graphics, forming a typical multimodal reasoning scenario. However, existing benchmarks are mostly focused on general domains, failing to reflect the unique structural complexity and domain-specific semantics inherent in scientific research. Chemical tables are particularly representative: they intertwine structured variables such as reagents, conditions, and yields with visual symbols like molecular structures and chemical formulas, posing significant challenges to models in cross-modal alignment and semantic parsing. To address this, we propose ChemTable-a large scale benchmark of chemical tables constructed from real-world literature, containing expert-annotated cell layouts, logical structures, and domain-specific labels. It supports two core tasks: (1) table recognition (structure and content extraction); and (2) table understanding (descriptive and reasoning-based question answering). Evaluation on ChemTable shows that while mainstream multimodal models perform reasonably well in layout parsing, they still face significant limitations when handling critical elements such as molecular structures and symbolic conventions. Closed-source models lead overall but still fall short of human-level performance. This work provides a realistic testing platform for evaluating scientific multimodal understanding, revealing the current bottlenecks in domain-specific reasoning and advancing the development of intelligent systems for scientific research.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2506.11375

Country:

Asia > China > Anhui Province > Hefei (0.04)
Europe > Switzerland > Vaud > Lausanne (0.04)
Europe > Russia (0.04)
(2 more...)

Genre: Research Report > New Finding (1.00)

Industry: Materials > Chemicals (0.94)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)

Add feedback

DeepMech: A Machine Learning Framework for Chemical Reaction Mechanism Prediction

Das, Manajit, Hoque, Ajnabiul, Baranwal, Mayank, Sunoj, Raghavan B.

arXiv.org Artificial IntelligenceDec-11-2025

Prediction of complete step-by-step chemical reaction mechanisms (CRMs) remains a major challenge. Whereas the traditional approaches in CRM tasks rely on expert-driven experiments or costly quantum chemical computations, contemporary deep learning (DL) alternatives ignore key intermediates and mechanistic steps and often suffer from hallucinations. We present DeepMech, an interpretable graph-based DL framework employing atom- and bond-level attention, guided by generalized templates of mechanistic operations (TMOps), to generate CRMs. Trained on our curated ReactMech dataset (~30K CRMs with 100K atom-mapped and mass-balanced elementary steps), DeepMech achieves 98.98+/-0.12% accuracy in predicting elementary steps and 95.94+/-0.21% in complete CRM tasks, besides maintaining high fidelity even in out-of-distribution scenarios as well as in predicting side and/or byproducts. Extension to multistep CRMs relevant to prebiotic chemistry, demonstrates the ability of DeepMech in effectively reconstructing 2 pathways from simple primordial substrates to complex biomolecules such as serine and aldopentose. Attention analysis identifies reactive atoms/bonds in line with chemical intuition, rendering our model interpretable and suitable for reaction design.

artificial intelligence, machine learning, reaction, (18 more...)

arXiv.org Artificial Intelligence

2509.15872

Country:

Asia > India > Maharashtra > Mumbai (0.05)
North America > United States > New Jersey > Hudson County > Hoboken (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(2 more...)

Genre:

Workflow (0.93)
Research Report > New Finding (0.67)

Industry:

Materials > Chemicals > Commodity Chemicals > Petrochemicals (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Simultaneous Tactile-Visual Perception for Learning Multimodal Robot Manipulation

Li, Yuyang, Chen, Yinghan, Zhao, Zihang, Li, Puhao, Liu, Tengyu, Huang, Siyuan, Zhu, Yixin

arXiv.org Artificial IntelligenceDec-11-2025

Robotic manipulation requires both rich multimodal perception and effective learning frameworks to handle complex real-world tasks. See-through-skin (STS) sensors, which combine tactile and visual perception, offer promising sensing capabilities, while modern imitation learning provides powerful tools for policy acquisition. However, existing STS designs lack simultaneous multimodal perception and suffer from unreliable tactile tracking. Furthermore, integrating these rich multimodal signals into learning-based manipulation pipelines remains an open challenge. We introduce TacThru, an STS sensor enabling simultaneous visual perception and robust tactile signal extraction, and TacThru-UMI, an imitation learning framework that leverages these multimodal signals for manipulation. Our sensor features a fully transparent elastomer, persistent illumination, novel keyline markers, and efficient tracking, while our learning system integrates these signals through a Transformer-based Diffusion Policy. Experiments on five challenging real-world tasks show that TacThru-UMI achieves an average success rate of 85.5%, significantly outperforming the baselines of alternating tactile-visual (66.3%) and vision-only (55.4%). The system excels in critical scenarios, including contact detection with thin and soft objects and precision manipulation requiring multimodal coordination. This work demonstrates that combining simultaneous multimodal perception with modern learning frameworks enables more precise, adaptable robotic manipulation.

artificial intelligence, machine learning, manipulation, (17 more...)

arXiv.org Artificial Intelligence

2512.09851

Country: Asia > China (0.28)

Genre: Research Report (0.64)

Industry: Materials (0.73)

Technology:

Information Technology > Artificial Intelligence > Robots > Manipulation (0.84)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

Predicting Polymer Solubility in Solvents Using SMILES Strings

Reinhard, Andrew

arXiv.org Artificial IntelligenceDec-11-2025

Understanding and predicting polymer solubility in various solvents is critical for applications ranging from recycling to pharmaceutical formulation. This work presents a deep learning framework that predicts polymer solubility, expressed as weight percent (wt%), directly from SMILES representations of both polymers and solvents. A dataset of 8,049 polymer solvent pairs at 25 deg C was constructed from calibrated molecular dynamics simulations (Zhou et al., 2023), and molecular descriptors and fingerprints were combined into a 2,394 feature representation per sample. A fully connected neural network with six hidden layers was trained using the Adam optimizer and evaluated using mean squared error loss, achieving strong agreement between predicted and actual solubility values. Generalizability was demonstrated using experimentally measured data from the Materials Genome Project, where the model maintained high accuracy on 25 unseen polymer solvent combinations. These findings highlight the viability of SMILES based machine learning models for scalable solubility prediction and high-throughput solvent screening, supporting applications in green chemistry, polymer processing, and materials design.

artificial intelligence, machine learning, polymer, (17 more...)

arXiv.org Artificial Intelligence

2512.09784

Genre: Research Report (0.82)

Industry: Materials > Chemicals > Commodity Chemicals > Petrochemicals > Polymers & Plastics (0.94)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Circuits, Features, and Heuristics in Molecular Transformers

Varadi, Kristof, Marosi, Mark, Antal, Peter

arXiv.org Artificial IntelligenceDec-11-2025

Transformers generate valid and diverse chemical structures, but little is known about the mechanisms that enable these models to capture the rules of molecular representation. We present a mechanistic analysis of autoregressive transformers trained on drug-like small molecules to reveal the computational structure underlying their capabilities across multiple levels of abstraction. We identify computational patterns consistent with low-level syntactic parsing and more abstract chemical validity constraints. Using sparse autoencoders (SAEs), we extract feature dictionaries associated with chemically relevant activation patterns. We validate our findings on downstream tasks and find that mechanistic insights can translate to predictive performance in various practical settings.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2512.09757

Country: Europe > Hungary (0.28)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Materials > Chemicals > Commodity Chemicals (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

On Mobile Ad Hoc Networks for Coverage of Partially Observable Worlds

Meriaux, Edwin, Wen, Shuo, Langevin, Louis-Roy, Precup, Doina, Loría, Antonio, Dudek, Gregory

arXiv.org Artificial IntelligenceDec-11-2025

This paper addresses the movement and placement of mobile agents to establish a communication network in initially unknown environments. We cast the problem in a computational-geometric framework by relating the coverage problem and line-of-sight constraints to the Cooperative Guard Art Gallery Problem, and introduce its partially observable variant, the Partially Observable Cooperative Guard Art Gallery Problem (POCGAGP). We then present two algorithms that solve POCGAGP: CADENCE, a centralized planner that incrementally selects 270 degree corners at which to deploy agents, and DADENCE, a decentralized scheme that coordinates agents using local information and lightweight messaging. Both approaches operate under partial observability and target simultaneous coverage and connectivity. We evaluate the methods in simulation across 1,500 test cases of varied size and structure, demonstrating consistent success in forming connected networks while covering and exploring unknown space. These results highlight the value of geometric abstractions for communication-driven exploration and show that decentralized policies are competitive with centralized performance while retaining scalability.

agent, artificial intelligence, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2512.09495

Country: Europe (0.28)

Genre: Research Report > New Finding (0.92)

Industry: Materials (0.92)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.47)

Add feedback

A Granular Framework for Construction Material Price Forecasting: Econometric and Machine-Learning Approaches

Lyu, Boge, Yin, Qianye, Tommelein, Iris Denise, Liu, Hanyang, Ranka, Karnamohit, Yeluripati, Karthik, Shi, Junzhe

arXiv.org Artificial IntelligenceDec-11-2025

This study develops a forecasting framework t hat leverages the Construction Specifications Institute (CSI) MasterFormat as the target data structure, enabling predictions at the six - digit section level and supporting detailed cost projections across a wide spectrum of building materials. To enhance p redictive accuracy, the framework integrates explanatory variables such as raw material prices, commodity indexes, and macroeconomic indicators. Four time - series models, Long Short - Term Memory (LSTM), Autoregressive Integrated Moving Average (ARIMA), Vecto r Error Correction Model (VECM), and Chronos - Bolt, were evaluated under both baseline configurations (using CSI data only) and extended versions with explanatory variables. Results demonstrate that incorporating explanatory variables significantly improves predictive performance across all models. Among the tested approaches, the LSTM model consistently ach ieved the highest accuracy, with RMSE values as low as 1.390 and MAPE values of 0.957, representing improvements of up to 59 % over traditional statistical time - series model, ARIMA. Validation across multiple CSI divisions confirmed the framework's scalability, while Division 06 (Wood, Plastics, and Composites) is presented in detail as a demonstration case. This research offers a robust methodology that enables owners and contractors to improve budgeting practices and achieve more reliable cost estimation at the Definitive level. INTRODUCTION 1.1 Motivation The construction industry continues to demonstrate steady long - term growth, with global activity projected to reach US$9.8 trillion by 2026 [1] . Major upcoming programs in the United States, such as the Los Angeles 2028 Olympics and TSMC's fabrication facility in Arizona [2] [3], highlight the scale of high - value projects in the near future. However, volatility in construction material prices has emerged as a critical challenge, creating significant uncertainty for contractors in project planning, budgeting, and cost management. Price fluctuations, driven by raw material costs, macroeconomic conditions such as inflation and interest rates, and supply - demand imbalances, have amplified risks of cost overruns and delays [4] [5] [6] [7] [8] . Traditional econometric methods (i.e.,multiple regression analysis) and modern econometric methods (i.e., univariate, and multivariate time series methods) have faced limitations in effectively capturing the high - frequency volatility observed in constructi on material prices [9] . These models often struggle to handle the complexity of input data and exhibit limited predictive accuracy in real - world applications.

artificial intelligence, deep learning, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2512.0936

Country:

North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.47)
North America > United States > California > Alameda County (0.28)

Genre: Research Report > New Finding (1.00)

Industry:

Materials > Construction Materials (1.00)
Construction & Engineering (1.00)
Government > Regional Government > North America Government > United States Government (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

AI-Driven Expansion and Application of the Alexandria Database

Cavignac, Théo, Schmidt, Jonathan, De Breuck, Pierre-Paul, Loew, Antoine, Cerqueira, Tiago F. T., Wang, Hai-Chen, Bochkarev, Anton, Lysogorskiy, Yury, Romero, Aldo H., Drautz, Ralf, Botti, Silvana, Marques, Miguel A. L.

arXiv.org Artificial IntelligenceDec-11-2025

We present a novel multi-stage workflow for computational materials discovery that achieves a 99% success rate in identifying compounds within 100 meV/atom of thermodynamic stability, with a threefold improvement over previous approaches. By combining the Matra-Genoa generative model, Orb-v2 universal machine learning interatomic potential, and ALIGNN graph neural network for energy prediction, we generated 119 million candidate structures and added 1.3 million DFT-validated compounds to the ALEXANDRIA database, including 74 thousand new stable materials. The expanded ALEXANDRIA database now contains 5.8 million structures with 175 thousand compounds on the convex hull. Predicted structural disorder rates (37-43%) match experimental databases, unlike other recent AI-generated datasets. Analysis reveals fundamental patterns in space group distributions, coordination environments, and phase stability networks, including sub-linear scaling of convex hull connectivity. We release the complete dataset, including sAlex25 with 14 million out-of-equilibrium structures containing forces and stresses for training universal force fields. We demonstrate that fine-tuning a GRACE model on this data improves benchmark accuracy. All data, models, and workflows are freely available under Creative Commons licenses.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2512.09169

Country:

Europe (1.00)
North America > United States (0.93)

Genre:

Workflow (1.00)
Research Report (1.00)

Industry:

Materials > Chemicals (0.46)
Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Partial Inverse Design of High-Performance Concrete Using Cooperative Neural Networks for Constraint-Aware Mix Generation

Nugraha, Agung, Im, Heungjun, Lee, Jihwan

arXiv.org Artificial IntelligenceDec-11-2025

High-performance concrete requires complex mix design decisions involving interdependent variables and practical constraints. While data-driven methods have improved predictive modeling for forward design in concrete engineering, inverse design remains limited, especially when some variables are fixed and only the remaining ones must be inferred. This study proposes a cooperative neural network framework for the partial inverse design of high-performance concrete. The framework integrates an imputation model with a surrogate strength predictor and learns through cooperative training. Once trained, it generates valid and performance-consistent mix designs in a single forward pass without retraining for different constraint scenarios. Compared with baseline models, including autoencoder models and Bayesian inference with Gaussian process surrogates, the proposed method achieves R-squared values of 0.87 to 0.92 and substantially reduces mean squared error by approximately 50% and 70%, respectively. The results show that the framework provides an accurate and computationally efficient foundation for constraint-aware, data-driven mix proportioning.

artificial intelligence, bayesian inference, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2512.06813

Genre: Research Report > New Finding (0.88)

Industry:

Materials > Construction Materials (1.00)
Construction & Engineering (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)

Add feedback