AITopics | different frequency

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.53)

Neural Information Processing SystemsDec-25-2025, 10:33:27 GMT

The Convergence Rate of Neural Networks for Learned Functions of Different Frequencies

We study the relationship between the frequency of a function and the speed at which a neural network learns it. We build on recent results that show that the dynamics of overparameterized neural networks trained with gradient descent can be well approximated by a linear system. When normalized training data is uniformly distributed on a hypersphere, the eigenfunctions of this linear system are spherical harmonic functions. We derive the corresponding eigenvalues for each frequency after introducing a bias term in the model. This bias term had been omitted from the linear network model without significantly affecting previous theoretical results. However, we show theoretically and experimentally that a shallow neural network without bias cannot represent or learn simple, low frequency functions with odd frequencies. Our results lead to specific predictions of the time it will take a network to learn functions of varying frequency. These predictions match the empirical behavior of both shallow and deep networks.

convergence rate, learned function, neural network, (7 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.78)

Neural Information Processing SystemsOct-8-2025, 11:17:38 GMT

382a8606a85ca6ec7c06185a1a95ce8b-Supplemental-Conference.pdf

artificial intelligence, machine learning, neural lad, (15 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.33)

arXiv.org Artificial IntelligenceApr-29-2025

Dual-channel Heterophilic Message Passing for Graph Fraud Detection

Zhang, Wenxin, Zhong, Jingxing, Yao, Guangzhen, Han, Renda, Lin, Xiaojian, Zhang, Zeyu, Luo, Cuicui

--Fraudulent activities have significantly increased across various domains, such as e-commerce, online review platforms, and social networks, making fraud detection a critical task. Spatial Graph Neural Networks (GNNs) have been successfully applied to fraud detection tasks due to their strong inductive learning capabilities. However, existing spatial GNN-based methods often enhance the graph structure by excluding heterophilic neighbors during message passing to align with the homophilic bias of GNNs. Unfortunately, this approach can disrupt the original graph topology and increase uncertainty in predictions. T o address these limitations, this paper proposes a novel framework, Dual-channel Heterophilic Message Passing (DHMP), for fraud detection. DHMP leverages a heterophily separation module to divide the graph into homophilic and heterophilic subgraphs, mitigating the low-pass inductive bias of traditional GNNs. It then applies shared weights to capture signals at different frequencies independently and incorporates a customized sampling strategy for training. This allows nodes to adaptively balance the contributions of various signals based on their labels. Extensive experiments on three real-world datasets demonstrate that DHMP outperforms existing methods, highlighting the importance of separating signals with different frequencies for improved fraud detection. The code is available at https://github.com/shaieesss/DHMP.

artificial intelligence, dhmp, machine learning, (15 more...)

2504.14205

Country:

Asia > China > Beijing > Beijing (0.04)
Asia > China > Fujian Province > Fuzhou (0.04)
Oceania > Australia > Australian Capital Territory > Canberra (0.04)
(2 more...)

Genre: Research Report (0.64)

Industry:

Law Enforcement & Public Safety > Fraud (1.00)
Information Technology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Information Management (0.93)

Neural Information Processing SystemsJan-23-2025, 23:49:54 GMT

Reviews: The Convergence Rate of Neural Networks for Learned Functions of Different Frequencies

What functions do NNs learn (approximate a function) and how fast are central questions in the study of the dynamics of (D)NNs. A common conception behind this problem is that if one trains a network longer than necessary, then the model might overfit. However, the definition of overfitting appears to vary from paper to paper. Moreover, overfitting is intimately linked with another hot topic in the area: over-parametrization. Please refer to "Advani & Saxe 2017 High Dimensional Dynamics of Gen Error for NNs" for a modern take on this link. Keeping in mind this link, we focus on fixed-size networks.

convergence rate, different frequency, learned function, (4 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.52)

Neural Information Processing SystemsJan-23-2025, 23:49:43 GMT

Reviews: The Convergence Rate of Neural Networks for Learned Functions of Different Frequencies

It finds that lower frequencies learn first, and finds that biases allow for learning of odd frequencies. The restriction to spherical data is limiting, but the analysis and conclusions (particularly the rates of convergence) are novel and interesting.

convergence rate, different frequency, learned function, (1 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.83)

Neural Information Processing SystemsOct-10-2024, 02:29:22 GMT

The Convergence Rate of Neural Networks for Learned Functions of Different Frequencies

convergence rate, different frequency, learned function, (4 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.97)

Wang, Yixuan, Siegel, Jonathan W., Liu, Ziming, Hou, Thomas Y.

On the expressiveness and spectral bias of KANs

arXiv.org Artificial IntelligenceOct-2-2024

Kolmogorov-Arnold Networks (KAN) \cite{liu2024kan} were very recently proposed as a potential alternative to the prevalent architectural backbone of many deep learning models, the multi-layer perceptron (MLP). KANs have seen success in various tasks of AI for science, with their empirical efficiency and accuracy demostrated in function regression, PDE solving, and many more scientific problems. In this article, we revisit the comparison of KANs and MLPs, with emphasis on a theoretical perspective. On the one hand, we compare the representation and approximation capabilities of KANs and MLPs. We establish that MLPs can be represented using KANs of a comparable size. This shows that the approximation and representation capabilities of KANs are at least as good as MLPs. Conversely, we show that KANs can be represented using MLPs, but that in this representation the number of parameters increases by a factor of the KAN grid size. This suggests that KANs with a large grid size may be more efficient than MLPs at approximating certain functions. On the other hand, from the perspective of learning and optimization, we study the spectral bias of KANs compared with MLPs. We demonstrate that KANs are less biased toward low frequencies than MLPs. We highlight that the multi-level learning feature specific to KANs, i.e. grid extension of splines, improves the learning process for high-frequency components. Detailed comparisons with different choices of depth, width, and grid sizes of KANs are made, shedding some light on how to choose the hyperparameters in practice.

artificial intelligence, arxiv preprint arxiv, machine learning, (16 more...)

2410.01803

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > Texas (0.04)
(3 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceJul-18-2024

Not All Frequencies Are Created Equal:Towards a Dynamic Fusion of Frequencies in Time-Series Forecasting

Zhang, Xingyu, Zhao, Siyu, Song, Zeen, Guo, Huijie, Zhang, Jianqi, Zheng, Changwen, Qiang, Wenwen

Long-term time series forecasting is a long-standing challenge in various applications. A central issue in time series forecasting is that methods should expressively capture long-term dependency. Furthermore, time series forecasting methods should be flexible when applied to different scenarios. Although Fourier analysis offers an alternative to effectively capture reusable and periodic patterns to achieve long-term forecasting in different scenarios, existing methods often assume high-frequency components represent noise and should be discarded in time series forecasting. However, we conduct a series of motivation experiments and discover that the role of certain frequencies varies depending on the scenarios. In some scenarios, removing high-frequency components from the original time series can improve the forecasting performance, while in others scenarios, removing them is harmful to forecasting performance. Therefore, it is necessary to treat the frequencies differently according to specific scenarios. To achieve this, we first reformulate the time series forecasting problem as learning a transfer function of each frequency in the Fourier domain. Further, we design Frequency Dynamic Fusion (FreDF), which individually predicts each Fourier component, and dynamically fuses the output of different frequencies. Moreover, we provide a novel insight into the generalization ability of time series forecasting and propose the generalization bound of time series forecasting. Then we prove FreDF has a lower bound, indicating that FreDF has better generalization ability. Extensive experiments conducted on multiple benchmark datasets and ablation studies demonstrate the effectiveness of FreDF.

dataset, forecasting, frequency, (13 more...)

2407.12415

Country:

Oceania > Australia > Victoria > Melbourne (0.05)
Asia > China > Beijing > Beijing (0.05)
North America > United States > Massachusetts (0.04)
(2 more...)

Genre: Research Report (1.00)

Industry: Energy (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

arXiv.org Artificial IntelligenceApr-25-2024

Channel Modeling for FR3 Upper Mid-band via Generative Adversarial Networks

Hu, Yaqi, Yin, Mingsheng, Mezzavilla, Marco, Guo, Hao, Rangan, Sundeep

The upper mid-band (FR3) has been recently attracting interest for new generation of mobile networks, as it provides a promising balance between spectrum availability and coverage, which are inherent limitations of the sub 6GHz and millimeter wave bands, respectively. In order to efficiently design and optimize the network, channel modeling plays a key role since FR3 systems are expected to operate at multiple frequency bands. Data-driven methods, especially generative adversarial networks (GANs), can capture the intricate relationships among data samples, and provide an appropriate tool for FR3 channel modeling. In this work, we present the architecture, link state model, and path generative network of GAN-based FR3 channel modeling. The comparison of our model greatly matches the ray-tracing simulated data.

channel modeling, frequency, vector, (10 more...)

2404.17069

Country:

Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > New York > Kings County > New York City (0.04)
North America > United States > California > Monterey County > Pacific Grove (0.04)
(4 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.61)