AITopics | non-linear operator

Collaborating Authors

non-linear operator

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

474815daf1d4096ff78b7e4fdd2086a5-Paper-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 16:55:23 GMT

commute, non-linear operator, operator, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > New York (0.04)
Europe > France > Île-de-France > Paris > Paris (0.04)
Asia > Singapore (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

On Non-Linear operators for Geometric Deep Learning

Neural Information Processing SystemsDec-24-2025, 03:27:31 GMT

This work studies operators mapping vector and scalar fields defined over a manifold $\mathcal{M}$, and which commute with its group of diffeomorphisms $\text{Diff}(\mathcal{M})$. We prove that in the case of scalar fields $L^p_\omega(\mathcal{M,\mathbb{R}})$, those operators correspond to point-wise non-linearities, recovering and extending known results on $\mathbb{R}^d$. In the context of Neural Networks defined over $\mathcal{M}$, it indicates that point-wise non-linear operators are the only universal family that commutes with any group of symmetries, and justifies their systematic use in combination with dedicated linear operators commuting with specific symmetries. In the case of vector fields $L^p_\omega(\mathcal{M},T\mathcal{M})$, we show that those operators are solely the scalar multiplication. It indicates that $\text{Diff}(\mathcal{M})$ is too rich and that there is no universal class of non-linear operators to motivate the design of Neural Networks over the symmetries of $\mathcal{M}$.

mathcal, non-linear operator, operator, (12 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.64)

Add feedback

474815daf1d4096ff78b7e4fdd2086a5-Paper-Conference.pdf

Neural Information Processing SystemsAug-14-2025, 14:19:27 GMT

commute, non-linear operator, operator, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > New York (0.04)
Europe > France > Île-de-France > Paris > Paris (0.04)
Asia > Singapore (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

On Non-Linear operators for Geometric Deep Learning

Neural Information Processing SystemsOct-10-2024, 21:38:57 GMT

This work studies operators mapping vector and scalar fields defined over a manifold \mathcal{M}, and which commute with its group of diffeomorphisms \text{Diff}(\mathcal{M}) . We prove that in the case of scalar fields L p_\omega(\mathcal{M,\mathbb{R}}), those operators correspond to point-wise non-linearities, recovering and extending known results on \mathbb{R} d . In the context of Neural Networks defined over \mathcal{M}, it indicates that point-wise non-linear operators are the only universal family that commutes with any group of symmetries, and justifies their systematic use in combination with dedicated linear operators commuting with specific symmetries. In the case of vector fields L p_\omega(\mathcal{M},T\mathcal{M}), we show that those operators are solely the scalar multiplication. It indicates that \text{Diff}(\mathcal{M}) is too rich and that there is no universal class of non-linear operators to motivate the design of Neural Networks over the symmetries of \mathcal{M} .

geometric deep learning, mathcal, operator, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.40)

Add feedback

I-LLM: Efficient Integer-Only Inference for Fully-Quantized Low-Bit Large Language Models

Hu, Xing, Cheng, Yuan, Yang, Dawei, Yuan, Zhihang, Yu, Jiangyong, Xu, Chen, Zhou, Sifan

arXiv.org Artificial IntelligenceJun-5-2024

Post-training quantization (PTQ) serves as a potent technique to accelerate the inference of large language models (LLMs). Nonetheless, existing works still necessitate a considerable number of floating-point (FP) operations during inference, including additional quantization and de-quantization, as well as non-linear operators such as RMSNorm and Softmax. This limitation hinders the deployment of LLMs on the edge and cloud devices. In this paper, we identify the primary obstacle to integer-only quantization for LLMs lies in the large fluctuation of activations across channels and tokens in both linear and non-linear operations. To address this issue, we propose I-LLM, a novel integer-only fully-quantized PTQ framework tailored for LLMs. Specifically, (1) we develop Fully-Smooth Block-Reconstruction (FSBR) to aggressively smooth inter-channel variations of all activations and weights. (2) to alleviate degradation caused by inter-token variations, we introduce a novel approach called Dynamic Integer-only MatMul (DI-MatMul). This method enables dynamic quantization in full-integer matrix multiplication by dynamically quantizing the input and outputs with integer-only operations. (3) we design DI-ClippedSoftmax, DI-Exp, and DI-Normalization, which utilize bit shift to execute non-linear operators efficiently while maintaining accuracy. The experiment shows that our I-LLM achieves comparable accuracy to the FP baseline and outperforms non-integer quantization methods. For example, I-LLM can operate at W4A4 with negligible loss of accuracy. To our knowledge, we are the first to bridge the gap between integer-only quantization and LLMs. We've published our code on anonymous.4open.science, aiming to contribute to the advancement of this field.

arxiv preprint arxiv, opération, quantization, (14 more...)

arXiv.org Artificial Intelligence

2405.17849

Country:

Asia > Myanmar > Tanintharyi Region > Dawei (0.04)
Asia > China > Jiangsu Province > Nanjing (0.04)

Genre: Research Report > Promising Solution (0.34)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Jetfire: Efficient and Accurate Transformer Pretraining with INT8 Data Flow and Per-Block Quantization

Xi, Haocheng, Chen, Yuxiang, Zhao, Kang, Zheng, Kaijun, Chen, Jianfei, Zhu, Jun

arXiv.org Artificial IntelligenceMar-19-2024

Pretraining transformers are generally time-consuming. Fully quantized training (FQT) is a promising approach to speed up pretraining. However, most FQT methods adopt a quantize-compute-dequantize procedure, which often leads to suboptimal speedup and significant performance degradation when used in transformers due to the high memory access overheads and low-precision computations. In this work, we propose Jetfire, an efficient and accurate INT8 training method specific to transformers. Our method features an INT8 data flow to optimize memory access and a per-block quantization method to maintain the accuracy of pretrained transformers. Extensive experiments demonstrate that our INT8 FQT method achieves comparable accuracy to the FP16 training baseline and outperforms the existing INT8 training works for transformers. Moreover, for a standard transformer block, our method offers an end-to-end training speedup of 1.42x and a 1.49x memory reduction compared to the FP16 baseline.

operator, quantization, submission and formatting instruction, (13 more...)

arXiv.org Artificial Intelligence

2403.12422

Genre: Research Report > Promising Solution (0.34)

Technology: