AITopics | Jiao, Yuling

Collaborating Authors

Jiao, Yuling

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

GAS: A Gaussian Mixture Distribution-Based Adaptive Sampling Method for PINNs

Jiao, Yuling, Li, Di, Lu, Xiliang, Yang, Jerry Zhijian, Yuan, Cheng

arXiv.org Artificial IntelligenceApr-7-2023

While Both DRM and WAN use the variational forms of the PDEs, PINNs solves PDEs by a direct minimization of the square residuals, which makes the method more flexible and easier to be formulated to general problems. By reformulating the original PDE problems into a optimization problems, The physics-informed neural networks [1, 4, 5, 6] can easily approximate the solution of PDE with a deep neural network (DNN) function space, through minimizing the corresponding loss term which is defined as the integral of residuals. In practice, this integral is approximated by using Monte Carlo (MC) methods with finite points, which are usually sampled according to a uniform distribution on the computational domain. In contrast to classical computational methods, where the main concern is the approximation error, one needs to balance the approximation error and the generalization error for the neural network solvers, where the approximation error mainly originates from the modeling capability of the neural network, while the generalization error is mainly related to the discretization of loss with random samples. While a uniform random sampling strategy is simple to implement, the low regularity regions of the solution may not be taken consideration enough, which makes PINNs be inefficient and even inaccurate, especially when the problems is high dimensional.

artificial intelligence, machine learning, pinn, (16 more...)

arXiv.org Artificial Intelligence

2303.15849

Country: Asia > China (0.14)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Convergence Analysis of the Deep Galerkin Method for Weak Solutions

Jiao, Yuling, Lai, Yanming, Wang, Yang, Yang, Haizhao, Yang, Yunfei

arXiv.org Artificial IntelligenceFeb-5-2023

This paper analyzes the convergence rate of a deep Galerkin method for the weak solution (DGMW) of second-order elliptic partial differential equations on $\mathbb{R}^d$ with Dirichlet, Neumann, and Robin boundary conditions, respectively. In DGMW, a deep neural network is applied to parametrize the PDE solution, and a second neural network is adopted to parametrize the test function in the traditional Galerkin formulation. By properly choosing the depth and width of these two networks in terms of the number of training samples $n$, it is shown that the convergence rate of DGMW is $\mathcal{O}(n^{-1/d})$, which is the first convergence result for weak solutions. The main idea of the proof is to divide the error of the DGMW into an approximation error and a statistical error. We derive an upper bound on the approximation error in the $H^{1}$ norm and bound the statistical error via Rademacher complexity.

artificial intelligence, machine learning, neural network, (15 more...)

arXiv.org Artificial Intelligence

2302.02405

Country:

Asia (0.68)
North America > United States > Maryland > Prince George's County > College Park (0.14)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Estimation of Non-Crossing Quantile Regression Process with Deep ReQU Neural Networks

Shen, Guohao, Jiao, Yuling, Lin, Yuanyuan, Horowitz, Joel L., Huang, Jian

arXiv.org Artificial IntelligenceJul-21-2022

We propose a penalized nonparametric approach to estimating the quantile regression process (QRP) in a nonseparable model using rectifier quadratic unit (ReQU) activated deep neural networks and introduce a novel penalty function to enforce non-crossing of quantile regression curves. We establish the non-asymptotic excess risk bounds for the estimated QRP and derive the mean integrated squared error for the estimated QRP under mild smoothness and regularity conditions. To establish these non-asymptotic risk and estimation error bounds, we also develop a new error bound for approximating $C^s$ smooth functions with $s >0$ and their derivatives using ReQU activated neural networks. This is a new approximation result for ReQU networks and is of independent interest and may be useful in other problems. Our numerical experiments demonstrate that the proposed method is competitive with or outperforms two existing methods, including methods using reproducing kernels and random forests, for nonparametric quantile regression.

artificial intelligence, machine learning, requ network, (18 more...)

arXiv.org Artificial Intelligence

2207.10442

Country:

Asia > China (0.28)
North America > United States (0.27)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)

Add feedback

Approximation bounds for norm constrained neural networks with applications to regression and GANs

Jiao, Yuling, Wang, Yang, Yang, Yunfei

arXiv.org Machine LearningJan-23-2022

This paper studies the approximation capacity of ReLU neural networks with norm constraint on the weights. We prove upper and lower bounds on the approximation error of these networks for smooth function classes. The lower bound is derived through the Rademacher complexity of neural networks, which may be of independent interest. We apply these approximation bounds to analyze the convergence of regression using norm constrained neural networks and distribution estimation by GANs. In particular, we obtain convergence rates for over-parameterized neural networks. It is also shown that GANs can achieve optimal rate of learning probability distributions, when the discriminator is a properly chosen norm constrained neural network.

artificial intelligence, machine learning, neural network, (17 more...)

arXiv.org Machine Learning

2201.09418

Country:

Asia > China > Hong Kong (0.14)
Europe > United Kingdom > England (0.14)

Genre: Research Report (0.84)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Just Least Squares: Binary Compressive Sampling with Low Generative Intrinsic Dimension

Jiao, Yuling, Li, Dingwei, Liu, Min, Lu, Xiangliang, Yang, Yuanyuan

arXiv.org Machine LearningNov-29-2021

In this paper, we consider recovering $n$ dimensional signals from $m$ binary measurements corrupted by noises and sign flips under the assumption that the target signals have low generative intrinsic dimension, i.e., the target signals can be approximately generated via an $L$-Lipschitz generator $G: \mathbb{R}^k\rightarrow\mathbb{R}^{n}, k\ll n$. Although the binary measurements model is highly nonlinear, we propose a least square decoder and prove that, up to a constant $c$, with high probability, the least square decoder achieves a sharp estimation error $\mathcal{O} (\sqrt{\frac{k\log (Ln)}{m}})$ as long as $m\geq \mathcal{O}( k\log (Ln))$. Extensive numerical simulations and comparisons with state-of-the-art methods demonstrated the least square decoder is robust to noise and sign flips, as indicated by our theory. By constructing a ReLU network with properly chosen depth and width, we verify the (approximately) deep generative prior, which is of independent interest.

artificial intelligence, machine learning, neural network, (17 more...)

arXiv.org Machine Learning

2111.14486

Country:

Asia > China > Hubei Province (0.15)
Europe > United Kingdom > England (0.14)

Genre: Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Add feedback

A Data-Driven Line Search Rule for Support Recovery in High-dimensional Data Analysis

Li, Peili, Jiao, Yuling, Lu, Xiliang, Kang, Lican

arXiv.org Machine LearningNov-21-2021

In this work, we consider the algorithm to the (nonlinear) regression problems with $\ell_0$ penalty. The existing algorithms for $\ell_0$ based optimization problem are often carried out with a fixed step size, and the selection of an appropriate step size depends on the restricted strong convexity and smoothness for the loss function, hence it is difficult to compute in practical calculation. In sprite of the ideas of support detection and root finding \cite{HJK2020}, we proposes a novel and efficient data-driven line search rule to adaptively determine the appropriate step size. We prove the $\ell_2$ error bound to the proposed algorithm without much restrictions for the cost functional. A large number of numerical comparisons with state-of-the-art algorithms in linear and logistic regression problems show the stability, effectiveness and superiority of the proposed algorithms.

artificial intelligence, health & medicine, machine learning, (15 more...)

arXiv.org Machine Learning

2111.10806

Country: Asia > China (0.29)

Genre:

Research Report > New Finding (0.50)
Research Report > Experimental Study (0.50)

Industry: Health & Medicine > Therapeutic Area (0.47)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Add feedback

Non-Asymptotic Error Bounds for Bidirectional GANs

Liu, Shiao, Yang, Yunfei, Huang, Jian, Jiao, Yuling, Wang, Yang

arXiv.org Machine LearningOct-23-2021

We derive nearly sharp bounds for the bidirectional GAN (BiGAN) estimation error under the Dudley distance between the latent joint distribution and the data joint distribution with appropriately specified architecture of the neural networks used in the model. To the best of our knowledge, this is the first theoretical guarantee for the bidirectional GAN learning approach. An appealing feature of our results is that they do not assume the reference and the data distributions to have the same dimensions or these distributions to have bounded support. These assumptions are commonly assumed in the existing convergence analysis of the unidirectional GANs but may not be satisfied in practice. Our results are also applicable to the Wasserstein bidirectional GAN if the target distribution is assumed to have a bounded support. To prove these results, we construct neural network functions that push forward an empirical distribution to another arbitrary empirical distribution on a possibly different-dimensional space. We also develop a novel decomposition of the integral probability metric for the error analysis of bidirectional GANs. These basic theoretical results are of independent interest and can be applied to other related learning problems.

artificial intelligence, machine learning, neural network, (17 more...)

arXiv.org Machine Learning

2110.12319

Country:

Asia > China (0.46)
North America > United States > Iowa > Johnson County > Iowa City (0.14)

Genre: Research Report > New Finding (0.54)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Relative Entropy Gradient Sampler for Unnormalized Distributions

Feng, Xingdong, Gao, Yuan, Huang, Jian, Jiao, Yuling, Liu, Xu

arXiv.org Machine LearningOct-6-2021

We propose a relative entropy gradient sampler (REGS) for sampling from unnormalized distributions. REGS is a particle method that seeks a sequence of simple nonlinear transforms iteratively pushing the initial samples from a reference distribution into the samples from an unnormalized target distribution. To determine the nonlinear transforms at each iteration, we consider the Wasserstein gradient flow of relative entropy. This gradient flow determines a path of probability distributions that interpolates the reference distribution and the target distribution. It is characterized by an ODE system with velocity fields depending on the density ratios of the density of evolving particles and the unnormalized target density. To sample with REGS, we need to estimate the density ratios and simulate the ODE system with particle evolution. We propose a novel nonparametric approach to estimating the logarithmic density ratio using neural networks. Extensive simulation studies on challenging multimodal 1D and 2D mixture distributions and Bayesian logistic regression on real datasets demonstrate that the REGS outperforms the state-of-the-art sampling methods included in the comparison.

artificial intelligence, machine learning, particle proportion, (13 more...)

arXiv.org Machine Learning

2110.02787

Country:

Asia > China (0.28)
North America > United States > Iowa > Johnson County > Iowa City (0.14)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.34)

Add feedback

Coordinate Descent for MCP/SCAD Penalized Least Squares Converges Linearly

Jiao, Yuling, Li, Dingwei, Liu, Min, Lu, Xiliang

arXiv.org Machine LearningSep-18-2021

Recovering sparse signals from observed data is an important topic in signal/imaging processing, statistics and machine learning. Nonconvex penalized least squares have been attracted a lot of attentions since they enjoy nice statistical properties. Computationally, coordinate descent (CD) is a workhorse for minimizing the nonconvex penalized least squares criterion due to its simplicity and scalability. In this work, we prove the linear convergence rate to CD for solving MCP/SCAD penalized least squares problems.

artificial intelligence, optimization problem, statistics, (17 more...)

arXiv.org Machine Learning

2109.0885

Country: Asia > China (0.17)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.89)

Add feedback

An error analysis of generative adversarial networks for learning distributions

Huang, Jian, Jiao, Yuling, Li, Zhen, Liu, Shiao, Wang, Yang, Yang, Yunfei

arXiv.org Machine LearningJun-11-2021

This paper studies how well generative adversarial networks (GANs) learn probability distributions from finite samples. Our main results establish the convergence rates of GANs under a collection of integral probability metrics defined through H\"older classes, including the Wasserstein distance as a special case. We also show that GANs are able to adaptively learn data distributions with low-dimensional structures or have H\"older densities, when the network architectures are chosen properly. In particular, for distributions concentrated around a low-dimensional set, we show that the learning rates of GANs do not depend on the high ambient dimension, but on the lower intrinsic dimension. Our analysis is based on a new oracle inequality decomposing the estimation error into the generator and discriminator approximation error and the statistical error, which may be of independent interest.

approximation error, artificial intelligence, neural network, (14 more...)

arXiv.org Machine Learning

2105.1301

Country:

Asia > China (0.92)
North America > United States > Iowa > Johnson County > Iowa City (0.14)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback