Hyperplane bounds for neural feature mappings

Jan-15-2022–arXiv.org Artificial Intelligence

When minimising the empirical risk, the generalisation of the learnt function still depends on the performance on the training data, the Vapnik-Chervonenkis(VC)- dimension of the function and the number of training examples. Neural networks have a large number of parameters, which correlates with their VC-dimension that is typically large but not infinite, and typically a large number of training instances are needed to effectively train them. In this work, we explore how to optimize feature mappings using neural network with the intention to reduce the effective VC-dimension of the hyperplane found in the space generatedby the mapping. An interpretationofthe resultsofthis study isthat it ispossible to define a loss that controls the VC-dimension of the separating hyperplane. We evaluate this approach and observe that the performance when using this method improves when the size of the training set is small.

artificial intelligence, inductive learning, machine learning, (17 more...)

arXiv.org Artificial Intelligence

Jan-15-2022

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.28)

Genre:
- Research Report (0.64)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Computational Learning Theory (0.97)
  - Inductive Learning (1.00)
  - Neural Networks > Deep Learning (0.69)
  - Statistical Learning (1.00)