AITopics | Shao, Yuan-Hai

Collaborating Authors

Shao, Yuan-Hai

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Learning using granularity statistical invariants for classification

Zhu, Ting-Ting, Shao, Yuan-Hai, Li, Chun-Na, Liu, Tian

arXiv.org Artificial IntelligenceMar-29-2024

Learning using statistical invariants (LUSI) is a new learning paradigm, which adopts weak convergence mechanism, and can be applied to a wider range of classification problems. However, the computation cost of invariant matrices in LUSI is high for large-scale datasets during training. To settle this issue, this paper introduces a granularity statistical invariant for LUSI, and develops a new learning paradigm called learning using granularity statistical invariants (LUGSI). LUGSI employs both strong and weak convergence mechanisms, taking a perspective of minimizing expected risk. As far as we know, it is the first time to construct granularity statistical invariants. Compared to LUSI, the introduction of this new statistical invariant brings two advantages. Firstly, it enhances the structural information of the data. Secondly, LUGSI transforms a large invariant matrix into a smaller one by maximizing the distance between classes, achieving feasibility for large-scale datasets classification problems and significantly enhancing the training speed of model operations. Experimental results indicate that LUGSI not only exhibits improved generalization capabilities but also demonstrates faster training speed, particularly for large-scale datasets.

artificial intelligence, dataset, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2403.20122

Country: Asia > China (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.68)

Add feedback

Least Squares Maximum and Weighted Generalization-Memorization Machines

Wang, Shuai, Wang, Zhen, Shao, Yuan-Hai

arXiv.org Machine LearningAug-31-2023

In this paper, we propose a new way of remembering by introducing a memory influence mechanism for the least squares support vector machine (LSSVM). Without changing the equation constraints of the original LSSVM, this mechanism, allows an accurate partitioning of the training set without overfitting. The maximum memory impact model (MIMM) and the weighted impact memory model (WIMM) are then proposed. It is demonstrated that these models can be degraded to the LSSVM. Furthermore, we propose some different memory impact functions for the MIMM and WIMM. The experimental results show that that our MIMM and WIMM have better generalization performance compared to the LSSVM and significant advantage in time cost compared to other memory models.

accuracy, artificial intelligence, machine learning, (18 more...)

arXiv.org Machine Learning

2308.16456

Country:

Asia > China > Hainan Province (0.14)
Asia > China > Inner Mongolia (0.14)

Genre: Research Report > New Finding (0.34)

Industry: Education (0.68)

Add feedback

Nonlinear Kernel Support Vector Machine with 0-1 Soft Margin Loss

Liu, Ju, Huang, Ling-Wei, Shao, Yuan-Hai, Chen, Wei-Jie, Li, Chun-Na

arXiv.org Artificial IntelligenceDec-1-2022

Recent advance on linear support vector machine with the 0-1 soft margin loss ($L_{0/1}$-SVM) shows that the 0-1 loss problem can be solved directly. However, its theoretical and algorithmic requirements restrict us extending the linear solving framework to its nonlinear kernel form directly, the absence of explicit expression of Lagrangian dual function of $L_{0/1}$-SVM is one big deficiency among of them. In this paper, by applying the nonparametric representation theorem, we propose a nonlinear model for support vector machine with 0-1 soft margin loss, called $L_{0/1}$-KSVM, which cunningly involves the kernel technique into it and more importantly, follows the success on systematically solving its linear task. Its optimal condition is explored theoretically and a working set selection alternating direction method of multipliers (ADMM) algorithm is introduced to acquire its numerical solution. Moreover, we firstly present a closed-form definition to the support vector (SV) of $L_{0/1}$-KSVM. Theoretically, we prove that all SVs of $L_{0/1}$-KSVM are only located on the parallel decision surfaces. The experiment part also shows that $L_{0/1}$-KSVM has much fewer SVs, simultaneously with a decent predicting accuracy, when comparing to its linear peer $L_{0/1}$-SVM and the other six nonlinear benchmark SVM classifiers.

artificial intelligence, classifier, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2203.00399

Country: Asia > China (0.28)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (1.00)

Add feedback

Generalization-Memorization Machines

Wang, Zhen, Shao, Yuan-Hai

arXiv.org Artificial IntelligenceJul-8-2022

Classifying the training data correctly without over-fitting is one of the goals in machine learning. In this paper, we propose a generalization-memorization mechanism, including a generalization-memorization decision and a memory modeling principle. Under this mechanism, error-based learning machines improve their memorization abilities of training data without over-fitting. Specifically, the generalization-memorization machines (GMM) are proposed by applying this mechanism. The optimization problems in GMM are quadratic programming problems and could be solved efficiently. It should be noted that the recently proposed generalization-memorization kernel and the corresponding support vector machines are the special cases of our GMM. Experimental results show the effectiveness of the proposed GMM both on memorization and generalization.

generalization-memorization machine, machine learning, optimization problem, (5 more...)

arXiv.org Artificial Intelligence

doi: 10.1016/j.patcog.2024.110469

2207.03976

Genre: Research Report (0.69)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Memory-Based Learning > Rote Learning (1.00)

Add feedback

Two-dimensional Bhattacharyya bound linear discriminant analysis with its applications

Guo, Yan-Ru, Bai, Yan-Qin, Li, Chun-Na, Bai, Lan, Shao, Yuan-Hai

arXiv.org Machine LearningNov-10-2020

Recently proposed L2-norm linear discriminant analysis criterion via the Bhattacharyya error bound estimation (L2BLDA) is an effective improvement of linear discriminant analysis (LDA) for feature extraction. However, L2BLDA is only proposed to cope with vector input samples. When facing with two-dimensional (2D) inputs, such as images, it will lose some useful information, since it does not consider intrinsic structure of images. In this paper, we extend L2BLDA to a two-dimensional Bhattacharyya bound linear discriminant analysis (2DBLDA). 2DBLDA maximizes the matrix-based between-class distance which is measured by the weighted pairwise distances of class means and meanwhile minimizes the matrix-based within-class distance. The weighting constant between the between-class and within-class terms is determined by the involved data that makes the proposed 2DBLDA adaptive. In addition, the criterion of 2DBLDA is equivalent to optimizing an upper bound of the Bhattacharyya error. The construction of 2DBLDA makes it avoid the small sample size problem while also possess robustness, and can be solved through a simple standard eigenvalue decomposition problem. The experimental results on image recognition and face image reconstruction demonstrate the effectiveness of the proposed methods.

artificial intelligence, linear discriminant analysis, machine learning, (15 more...)

arXiv.org Machine Learning

2011.05507

Country: Asia > China > Inner Mongolia (0.14)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Discriminant Analysis (1.00)

Add feedback

Capped norm linear discriminant analysis and its applications

Liu, Jiakou, Xiong, Xiong, Ren, Pei-Wei, Zhao, Da, Li, Chun-Na, Shao, Yuan-Hai

arXiv.org Machine LearningNov-4-2020

Classical linear discriminant analysis (LDA) is based on squared Frobenious norm and hence is sensitive to outliers and noise. To improve the robustness of LDA, in this paper, we introduce capped l_{2,1}-norm of a matrix, which employs non-squared l_2-norm and "capped" operation, and further propose a novel capped l_{2,1}-norm linear discriminant analysis, called CLDA. Due to the use of capped l_{2,1}-norm, CLDA can effectively remove extreme outliers and suppress the effect of noise data. In fact, CLDA can be also viewed as a weighted LDA. CLDA is solved through a series of generalized eigenvalue problems with theoretical convergency. The experimental results on an artificial data set, some UCI data sets and two image data sets demonstrate the effectiveness of CLDA.

artificial intelligence, clda, health & medicine, (16 more...)

arXiv.org Machine Learning

2011.02147

Country: North America > United States (0.28)

Genre: Research Report (1.00)

Industry: Health & Medicine (0.94)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Discriminant Analysis (0.82)

Add feedback

Principal Component Analysis Based on T$\ell_1$-norm Maximization

Yang, Xiang-Fei, Shao, Yuan-Hai, Li, Chun-Na, Liu, Li-Ming, Deng, Nai-Yang

arXiv.org Machine LearningMay-23-2020

Classical principal component analysis (PCA) may suffer from the sensitivity to outliers and noise. Therefore PCA based on $\ell_1$-norm and $\ell_p$-norm ($0 < p < 1$) have been studied. Among them, the ones based on $\ell_p$-norm seem to be most interesting from the robustness point of view. However, their numerical performance is not satisfactory. Note that, although T$\ell_1$-norm is similar to $\ell_p$-norm ($0 < p < 1$) in some sense, it has the stronger suppression effect to outliers and better continuity. So PCA based on T$\ell_1$-norm is proposed in this paper. Our numerical experiments have shown that its performance is superior than PCA-$\ell_p$ and $\ell_p$SPCA as well as PCA, PCA-$\ell_1$ obviously.

artificial intelligence, machine learning, tl 1, (13 more...)

arXiv.org Machine Learning

2005.12263

Country:

Asia > China (0.15)
North America > United States (0.14)

Genre: Research Report (0.65)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.62)

Add feedback

Single Versus Union: Non-parallel Support Vector Machine Frameworks

Li, Chun-Na, Shao, Yuan-Hai, Wang, Huajun, Zhao, Yu-Ting, Huang, Ling-Wei, Xiu, Naihua, Deng, Nai-Yang

arXiv.org Machine LearningOct-21-2019

JOURNAL OF L A T EX CLASS FILES, VOL., NO., 1 Single V ersus Union: Nonparallel Support V ector Machine Frameworks Chun-Na Li, Y uan-Hai Shao, Huajun Wang, Y u-Ting Zhao, Ling-Wei Huang, Naihua Xiu and Nai-Y ang Deng Abstract --Considering the classification problem, we summarize the nonparallel support vector machines with the nonparallel hyperplanes to two types of frameworks. It solves a series of small optimization problems to obtain a series of hyperplanes, but is hard to measure the loss of each sample. The other type constructs all the hyperplanes simultaneously, and it solves one big optimization problem with the ascertained loss of each sample. We give the characteristics of each framework and compare them carefully. In addition, based on the second framework, we construct a max-min distance-based nonparallel support vector machine for multiclass classification problem, called NSVM. Experimental results on benchmark data sets and human face databases show the advantages of our NSVM. I NTRODUCTION F OR binary classification problem, the generalized eigenvalue proximal support vector machine (GEPSVM) was proposed by Mangasarian and Wild [1] in 2006, which is the first nonparallel support vector machine. It aims at generating two nonparallel hyperplanes such that each hyperplane is closer to its class and as far as possible from the other class. GEPSVM is effective, particularly when dealing with the "Xor"-type data [1]. This leads to extensive studies on nonparallel support vector machines (NSVMs) [2]-[5].

artificial intelligence, machine learning, support vector machine, (16 more...)

arXiv.org Machine Learning

1910.09734

Country: Asia > China (0.46)

Genre: Research Report (1.00)

Industry: Health & Medicine (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (1.00)

Add feedback

A general model for plane-based clustering with loss function

Wang, Zhen, Shao, Yuan-Hai, Bai, Lan, Li, Chun-Na, Liu, Li-Ming

arXiv.org Machine LearningJan-26-2019

In this paper, we propose a general model for plane-based clustering. The general model contains many existing plane-based clustering methods, e.g., k-plane clustering (kPC), proximal plane clustering (PPC), twin support vector clustering (TWSVC) and its extensions. Under this general model, one may obtain an appropriate clustering method for specific purpose. The general model is a procedure corresponding to an optimization problem, where the optimization problem minimizes the total loss of the samples. Thereinto, the loss of a sample derives from both within-cluster and between-cluster. In theory, the termination conditions are discussed, and we prove that the general model terminates in a finite number of steps at a local or weak local optimal point. Furthermore, based on this general model, we propose a plane-based clustering method by introducing a new loss function to capture the data distribution precisely. Experimental results on artificial and public available datasets verify the effectiveness of the proposed method.

artificial intelligence, general model, health & medicine, (17 more...)

arXiv.org Machine Learning

1901.09178

Country: Asia > China > Inner Mongolia (0.14)

Genre: Research Report (0.40)

Industry: Health & Medicine (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Add feedback

Ramp-based Twin Support Vector Clustering

Wang, Zhen, Chen, Xu, Li, Chun-Na, Shao, Yuan-Hai

arXiv.org Machine LearningDec-10-2018

Traditional plane-based clustering methods measure the cost of within-cluster and between-cluster by quadratic, linear or some other unbounded functions, which may amplify the impact of cost. This letter introduces a ramp cost function into the plane-based clustering to propose a new clustering method, called ramp-based twin support vector clustering (RampTWSVC). RampTWSVC is more robust because of its boundness, and thus it is more easier to find the intrinsic clusters than other plane-based clustering methods. The non-convex programming problem in RampTWSVC is solved efficiently through an alternating iteration algorithm, and its local solution can be obtained in a finite number of iterations theoretically. In addition, the nonlinear manifold-based formation of RampTWSVC is also proposed by kernel trick. Experimental results on several benchmark datasets show the better performance of our RampTWSVC compared with other plane-based clustering methods.

artificial intelligence, machine learning, ramptwsvc, (14 more...)

arXiv.org Machine Learning

1812.0371

Country:

Asia > China > Inner Mongolia (0.14)
North America > United States > Massachusetts (0.14)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.74)

Add feedback