Complex-valued Neurons Can Learn More but Slower than Real-valued Neurons via Gradient Descent

Jan-17-2025, 12:52:18 GMT–Neural Information Processing Systems

Complex-valued neural networks potentially possess better representations and performance than real-valued counterparts when dealing with some complicated tasks such as acoustic analysis, radar image classification, etc. Despite empirical successes, it remains unknown theoretically when and to what extent complex-valued neural networks outperform real-valued ones. We take one step in this direction by comparing the learnability of real-valued neurons and complex-valued neurons via gradient descent. We show that a complex-valued neuron can efficiently learn functions expressed by any one real-valued neuron and any one complex-valued neuron with convergence rate O(t {-3}) and O(t {-1}) where t is the iteration index of gradient descent, respectively, whereas a two-layer real-valued neural network with finite width cannot learn a single non-degenerate complex-valued neuron. We prove that a complex-valued neuron learns a real-valued neuron with rate \Omega (t {-3}), exponentially slower than the O(\mathrm{e} {- c t}) rate of learning one real-valued neuron using a real-valued neuron with a constant c .

complex-valued neuron, neuron, real-valued neuron, (1 more...)

Neural Information Processing Systems

Jan-17-2025, 12:52:18 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Statistical Learning > Gradient Descent (0.89)
  - Neural Networks (0.78)