AITopics | Deng, Yangtao

Plotting

Deng, Yangtao

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Minder: Faulty Machine Detection for Large-scale Distributed Model Training

Deng, Yangtao, Shi, Xiang, Jiang, Zhuo, Zhang, Xingjian, Zhang, Lei, Zhang, Zhang, Li, Bo, Song, Zuquan, Zhu, Hang, Liu, Gaohong, Li, Fuliang, Wang, Shuguang, Lin, Haibin, Ye, Jianxi, Yu, Minlan

arXiv.org Artificial IntelligenceNov-3-2024

Large-scale distributed model training requires simultaneous training on up to thousands of machines. Faulty machine detection is critical when an unexpected fault occurs in a machine. From our experience, a training task can encounter two faults per day on average, possibly leading to a halt for hours. To address the drawbacks of the time-consuming and labor-intensive manual scrutiny, we propose Minder, an automatic faulty machine detector for distributed training tasks. The key idea of Minder is to automatically and efficiently detect faulty distinctive monitoring metric patterns, which could last for a period before the entire training task comes to a halt. Minder has been deployed in our production environment for over one year, monitoring daily distributed training tasks where each involves up to thousands of machines. In our real-world fault detection scenarios, Minder can accurately and efficiently react to faults within 3.6 seconds on average, with a precision of 0.904 and F1-score of 0.893.

data mining, large language model, machine learning, (21 more...)

arXiv.org Artificial Intelligence

2411.01791

Genre: Research Report (0.64)

Industry:

Health & Medicine (0.92)
Information Technology (0.70)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Communications > Networks (0.93)
(2 more...)

Add feedback

Moving Sampling Physics-informed Neural Networks induced by Moving Mesh PDE

Yang, Yu, Yang, Qihong, Deng, Yangtao, He, Qiaolin

arXiv.org Artificial IntelligenceJan-15-2024

Currently, many researchers have proposed widely used deep learning solvers based on deep neural networks, such as the Deep Ritz method [29], which solve the variational problems arising from PDEs; the Deep BSDE model [4], which is developed from stochastic differential equations and performs well at solving high-dimensional problems, and the DeepONet framework [12], which is used to learn operators accurately and efficiently from a relatively small dataset. In this article, we use physics-informed neural networks (PINN) [17]. In PINN, the governing equations of PDEs, boundary conditions, and related physical constraints are incorporated into the design of the loss function, and an optimization algorithm is used to find the network parameters to minimize the loss function, so that the approximated solution output by the neural networks satisfies the governing equations and constraints.

artificial intelligence, deep learning, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2311.16167

Country: Asia > China (0.14)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Neural Networks Based on Power Method and Inverse Power Method for Solving Linear Eigenvalue Problems

Yang, Qihong, Deng, Yangtao, Yang, Yu, He, Qiaolin, Zhang, Shiquan

arXiv.org Artificial IntelligenceJul-15-2023

In this article, we propose two kinds of neural networks inspired by power method and inverse power method to solve linear eigenvalue problems. These neural networks share similar ideas with traditional methods, in which the differential operator is realized by automatic differentiation. The eigenfunction of the eigenvalue problem is learned by the neural network and the iterative algorithms are implemented by optimizing the specially defined loss function. The largest positive eigenvalue, smallest eigenvalue and interior eigenvalues with the given prior knowledge can be solved efficiently. We examine the applicability and accuracy of our methods in the numerical experiments in one dimension, two dimensions and higher dimensions. Numerical results show that accurate eigenvalue and eigenfunction approximations can be obtained by our methods.

artificial intelligence, eigenfunction, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2209.11134

Genre: Research Report > New Finding (0.66)

Industry: Energy (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

On the uncertainty analysis of the data-enabled physics-informed neural network for solving neutron diffusion eigenvalue problem

Yang, Yu, Gong, Helin, Yang, Qihong, Deng, Yangtao, He, Qiaolin, Zhang, Shiquan

arXiv.org Artificial IntelligenceMar-17-2023

In practical engineering experiments, the data obtained through detectors are inevitably noisy. For the already proposed data-enabled physics-informed neural network (DEPINN) \citep{DEPINN}, we investigate the performance of DEPINN in calculating the neutron diffusion eigenvalue problem from several perspectives when the prior data contain different scales of noise. Further, in order to reduce the effect of noise and improve the utilization of the noisy prior data, we propose innovative interval loss functions and give some rigorous mathematical proofs. The robustness of DEPINN is examined on two typical benchmark problems through a large number of numerical results, and the effectiveness of the proposed interval loss function is demonstrated by comparison. This paper confirms the feasibility of the improved DEPINN for practical engineering applications in nuclear reactor physics.

data-enabled physics-informed neural network, neutron diffusion eigenvalue problem, uncertainty analysis

arXiv.org Artificial Intelligence

2303.08455

Genre: Research Report (0.66)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.60)

Add feedback