AITopics | pikan

Collaborating Authors

pikan

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Which Optimizer Works Best for Physics-Informed Neural Networks and Kolmogorov-Arnold Networks?

Kiyani, Elham, Shukla, Khemraj, Urbán, Jorge F., Darbon, Jérôme, Karniadakis, George Em

arXiv.org Artificial IntelligenceJan-22-2025

Physics-Informed Neural Networks (PINNs) have revolutionized the computation of PDE solutions by integrating partial differential equations (PDEs) into the neural network's training process as soft constraints, becoming an important component of the scientific machine learning (SciML) ecosystem. In its current implementation, PINNs are mainly optimized using first-order methods like Adam, as well as quasi-Newton methods such as BFGS and its low-memory variant, L-BFGS. However, these optimizers often struggle with highly non-linear and non-convex loss landscapes, leading to challenges such as slow convergence, local minima entrapment, and (non)degenerate saddle points. In this study, we investigate the performance of Self-Scaled Broyden (SSBroyden) methods and other advanced quasi-Newton schemes, including BFGS and L-BFGS with different line search strategies approaches. These methods dynamically rescale updates based on historical gradient information, thus enhancing training efficiency and accuracy. We systematically compare these optimizers on key challenging linear, stiff, multi-scale and non-linear PDEs benchmarks, including the Burgers, Allen-Cahn, Kuramoto-Sivashinsky, and Ginzburg-Landau equations, and extend our study to Physics-Informed Kolmogorov-Arnold Networks (PIKANs) representation. Our findings provide insights into the effectiveness of second-order optimization strategies in improving the convergence and accurate generalization of PINNs for complex PDEs by orders of magnitude compared to the state-of-the-art.

artificial intelligence, machine learning, ssbroyden, (19 more...)

arXiv.org Artificial Intelligence

2501.16371

Country:

North America > United States > Rhode Island > Providence County > Providence (0.04)
Europe > Spain (0.04)
Europe > Portugal > Braga > Braga (0.04)
(2 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Government > Regional Government (0.46)
Energy (0.46)
Education > Educational Setting > Online (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Physics Informed Kolmogorov-Arnold Neural Networks for Dynamical Analysis via Efficent-KAN and WAV-KAN

Patra, Subhajit, Panda, Sonali, Parida, Bikram Keshari, Arya, Mahima, Jacobs, Kurt, Bondar, Denys I., Sen, Abhijit

arXiv.org Artificial IntelligenceJul-28-2024

However, traditional deep neural networks often face challenges in achieving high accuracy without incurring significant computational costs. In this work, we implement the Physics-Informed Kolmogorov-Arnold Neural Networks (PIKAN) through efficient-KAN and WAV-KAN, which utilize the Kolmogorov-Arnold representation theorem. PIKAN demonstrates superior performance compared to conventional deep neural networks, achieving the same level of accuracy with fewer layers and reduced computational overhead. We explore both B-spline and wavelet-based implementations of PIKAN and benchmark their performance across various ordinary and partial differential equations using unsupervised (data-free) and supervised (data-driven) techniques. For certain differential equations, the data-free approach suffices to find accurate solutions, while in more complex scenarios, the data-driven method enhances the PIKAN's ability to converge to the correct solution. We validate our results against numerical solutions and achieve 99% accuracy in most scenarios. I. INTRODUCTION The advent of deep learning and its use cases in solving complicated tasks related to computer vision, natural language processing, speech, etc., has led to state-of-the-art applications in industries like healthcare, finance, robotics, to name a few. Further, using deep neural networks (DNNs) in solving differential equations through Physics Informed Neural Networks (PINNs) is another breakthrough that offered a new framework for solving partial differential equations [1]. Since then the field of PINN has received a lot of attention (e.g., see review [2]) and is extended to solve fractional equations, integral-differential equations, and stochastic partial differential equations [3-5]. PINN has been developed to be more robust and accurate [6] because the original form of PINN has drawbacks [7-12], which are emanate from deep networks. Recently, a promising alternative to the traditional multilayer perceptron has been proposed: the Kolmogorov-Arnold Neural Network (KAN) [13].

differential equation, equation, pikan, (14 more...)

arXiv.org Artificial Intelligence

2407.18373

Country:

North America > United States > Massachusetts > Suffolk County > Boston (0.04)
North America > United States > Maryland > Prince George's County > Adelphi (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(3 more...)

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback