AITopics

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Neural Information Processing SystemsMar-16-2026, 18:58:10 GMT

Exponentiated Strongly Rayleigh Distributions

Strongly Rayleigh (SR) measures are discrete probability distributions over the subsets of a ground set. They enjoy strong negative dependence properties, as a result of which they assign higher probability to subsets of diverse elements. We introduce in this paper Exponentiated Strongly Rayleigh (ESR) measures, which sharpen (or smoothen) the negative dependence property of SR measures via a single parameter (the exponent) that can intuitively understood as an inverse temperature. We develop efficient MCMC procedures for approximate sampling from ESRs, and obtain explicit mixing time bounds for two concrete instances: exponentiated versions of Determinantal Point Processes and Dual Volume Sampling. We illustrate some of the potential of ESRs, by applying them to a few machine learning tasks; empirical results confirm that beyond their theoretical appeal, ESR-based models hold significant promise for these tasks.

artificial intelligence, machine learning, proceedings, (6 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Neural Information Processing SystemsFeb-7-2026, 14:49:36 GMT

1730f69e6f66d5f0c741799e82351f81-Supplemental.pdf

cascade, diffusion model, infection probability, (16 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Gafos, Adamantios I., Kuberski, Stephan R.

Limit cycles for speech

arXiv.org Artificial IntelligenceDec-5-2025

Rhythmic fluctuations in acoustic energy and accompanying neuronal excitations in cortical oscillations are characteristic of human speech, yet whether a corresponding rhythmicity inheres in the articulatory movements that generate speech remains unclear. The received understanding of speech movements as discrete, goal-oriented actions struggles to make contact with the rhythmicity findings. In this work, we demonstrate that an unintuitive -- but no less principled than the conventional -- representation for discrete movements reveals a pervasive limit cycle organization and unlocks the recovery of previously inaccessible rhythmic structure underlying the motor activity of speech. These results help resolve a time-honored tension between the ubiquity of biological rhythmicity and discreteness in speech, the quintessential human higher function, by revealing a rhythmic organization at the most fundamental level of individual articulatory actions.

artificial intelligence, oscillator, speech, (15 more...)

2512.04642

Country: Europe > Germany (0.29)

Genre: Research Report (0.50)

Industry: Health & Medicine > Therapeutic Area (0.47)

Technology: Information Technology > Artificial Intelligence (1.00)

arXiv.org Artificial IntelligenceNov-20-2025

Joint Semantic-Channel Coding and Modulation for Token Communications

Ying, Jingkai, Qin, Zhijin, Feng, Yulong, Wang, Liejun, Tao, Xiaoming

In recent years, the Transformer architecture has achieved outstanding performance across a wide range of tasks and modalities. Token is the unified input and output representation in Transformer-based models, which has become a fundamental information unit. In this work, we consider the problem of token communication, studying how to transmit tokens efficiently and reliably. Point cloud, a prevailing three-dimensional format which exhibits a more complex spatial structure compared to image or video, is chosen to be the information source. We utilize the set abstraction method to obtain point tokens. Subsequently, to get a more informative and transmission-friendly representation based on tokens, we propose a joint semantic-channel and modulation (JSCCM) scheme for the token encoder, mapping point tokens to standard digital constellation points (modulated tokens). Specifically, the JSCCM consists of two parallel Point Transformer-based encoders and a differential modulator which combines the Gumel-softmax and soft quantization methods. Besides, the rate allocator and channel adapter are developed, facilitating adaptive generation of high-quality modulated tokens conditioned on both semantic information and channel conditions. Extensive simulations demonstrate that the proposed method outperforms both joint semantic-channel coding and traditional separate coding, achieving over 1dB gain in reconstruction and more than 6x compression ratio in modulated symbols.

large language model, machine learning, natural language, (18 more...)

2511.15699

Country:

Asia (1.00)
Europe (0.93)
North America > United States (0.68)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Giwa, Oluwaseyi, Mohsin, Muhammad Ahmed, Adesola, Folarin Jubril, Jamshed, Muhammad Ali

QPPG: Quantum-Preconditioned Policy Gradient for Link Adaptation in Rayleigh Fading Channels

arXiv.org Artificial IntelligenceOct-23-2025

IRELESS communication over fading channels remains one of the fundamental challenges in modern networks. In particular, Rayleigh fading channels, which model rich-scattering non-line-of-sight environments, cause rapid and unpredictable fluctuations in signal strength that can significantly degrade throughput and reliability. To mitigate these effects, link adaptation techniques such as adaptive modulation and coding (AMC) and power control have been extensively studied as key enablers of efficient spectrum use [1], [2]. Early works on link adaptation for Rayleigh fading channels demonstrated how explicit channel estimation and threshold-based switching could improve throughput and maintain robustness under fading conditions [3]-[6]. Despite their success, these classical approaches rely on accurate channel estimation, fixed rules, and often compromise between average throughput and outage probability in a suboptimal manner [4]-[6]. Furthermore, as networks evolve toward 6G with denser topologies and stringent reliability demands, such schemes struggle to scale or adapt to system-level complexities [7], [8]. Recent works have explored deep reinforcement learning (DRL) and meta reinforcement learning (RL) for link adaptation and resource allocation, showing promising adaptability but still facing high sample complexity and training instability [9]-[12]. In this letter, we propose quantum-preconditioned policy gradient (QPPG), a natural actor-critic method for link adap-Oluwaseyi Giwa is with the African Institute for Mathematical Sciences, South Africa (e-mail: {oluwaseyi}@aims.ac.za). Muhammad Ahmed Mohsin is with Stanford University, Stanford, California, 94305, United States (e-mail: {muahmed}@stanford.edu).

link adaptation, machine learning, reinforcement learning, (11 more...)

2506.15753

Country:

North America > United States > California > Santa Clara County > Stanford (0.24)
North America > United States > California > Santa Clara County > Palo Alto (0.24)

Genre: Research Report (0.40)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.78)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.49)

Neural Information Processing SystemsOct-2-2025, 06:02:28 GMT

1730f69e6f66d5f0c741799e82351f81-Supplemental.pdf

artificial intelligence, infection probability, machine learning, (18 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

arXiv.org Artificial IntelligenceMay-27-2025

ICDM: Interference Cancellation Diffusion Models for Wireless Semantic Communications

Wu, Tong, Chen, Zhiyong, He, Dazhi, Yang, Feng, Tao, Meixia, Xu, Xiaodong, Zhang, Wenjun, Zhang, Ping

--Diffusion models (DMs) have recently achieved significant success in wireless communications systems due to their denoising capabilities. The broadcast nature of wireless signals makes them susceptible not only to Gaussian noise, but also to unaware interference. This raises the question of whether DMs can effectively mitigate interference in wireless semantic communication systems. In this paper, we model the interference cancellation problem as a maximum a posteriori (MAP) problem over the joint posterior probability of the signal and interference, and theoretically prove that the solution provides excellent estimates for the signal and interference. T o solve this problem, we develop an interference cancellation diffusion model (ICDM), which decomposes the joint posterior into independent prior probabilities of the signal and interference, along with the channel transition probablity. The log-gradients of these distributions at each time step are learned separately by DMs and accurately estimated through deriving. ICDM further integrates these gradients with advanced numerical iteration method, achieving accurate and rapid interference cancellation. Extensive experiments demonstrate that ICDM significantly reduces the mean square error (MSE) and enhances perceptual quality compared to schemes without ICDM. For example, on the CelebA dataset under the Rayleigh fading channel with a signal-to-noise ratio (SNR) of 20 dB and signal to interference plus noise ratio (SINR) of 0 dB, ICDM reduces the MSE by 4 . Diffusion models (DMs) [1]-[4], which utilize a score function to model the gradient of the conditional data distribution, have recently achieved remarkable success in the field of artificial intelligence generated content (AIGC). These models are capable of generating controllable and high-quality content in various domains, including long-form text generation, controllable image generation, and consistent video generation. They have also become a fundamental technology for large language models (LLMs) such as GPT -4o. The inherent controllability of the content generated by diffusion models has significantly driven their application across diverse fields [5]-[7].

large language model, machine learning, natural language, (21 more...)

2505.19983

Country: Asia > China (0.46)

Genre: Research Report (0.64)

Yan, Yongli, Dai, Linglong

Decoding for Punctured Convolutional and Turbo Codes: A Deep Learning Solution for Protocols Compliance

arXiv.org Artificial IntelligenceFeb-21-2025

Neural network-based decoding methods have shown promise in enhancing error correction performance, but traditional approaches struggle with the challenges posed by punctured codes. In particular, these methods fail to address the complexities of variable code rates and the need for protocol compatibility. This paper presents a unified Long Short-Term Memory (LSTM)-based decoding architecture specifically designed to overcome these challenges. The proposed method unifies punctured convolutional and Turbo codes. A puncture embedding mechanism integrates puncturing patterns directly into the network, enabling seamless adaptation to varying code rates, while balanced bit error rate training ensures robustness across different code lengths, rates, and channels, maintaining protocol flexibility. Extensive simulations in Additive White Gaussian Noise and Rayleigh fading channels demonstrate that the proposed approach outperforms conventional decoding techniques, providing significant improvements in decoding accuracy and robustness. These results underscore the potential of LSTM-based decoding as a promising solution for next-generation artificial intelligence powered communication systems.

code rate, convolutional neural engine, decoder, (12 more...)

2502.15475

Country: Asia > Malaysia > Kuala Lumpur > Kuala Lumpur (0.04)

Genre: Research Report > New Finding (0.66)

Industry:

Education (0.76)
Information Technology (0.68)
Telecommunications (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Radwan, Ahmed Y., Shehab, Mohammad, Alouini, Mohamed-Slim

TinyML NLP Approach for Semantic Wireless Sentiment Classification

arXiv.org Artificial IntelligenceNov-9-2024

Natural Language Processing (NLP) operations, such as semantic sentiment analysis and text synthesis, may often impair users' privacy and demand significant on device computational resources. Centralized learning (CL) on the edge offers an alternative energy-efficient approach, yet requires the collection of raw information, which affects the user's privacy. While Federated learning (FL) preserves privacy, it requires high computational energy on board tiny user devices. We introduce split learning (SL) as an energy-efficient alternative, privacy-preserving tiny machine learning (TinyML) scheme and compare it to FL and CL in the presence of Rayleigh fading and additive noise. Our results show that SL reduces processing power and CO2 emissions while maintaining high accuracy, whereas FL offers a balanced compromise between efficiency and privacy. Hence, this study provides insights into deploying energy-efficient, privacy-preserving NLP models on edge devices.

large language model, machine learning, natural language, (20 more...)

2411.06291

Country:

North America > Canada > Ontario > Toronto (0.04)
Asia > Middle East > Saudi Arabia (0.04)

Genre: Research Report > New Finding (0.54)

Industry:

Information Technology > Security & Privacy (0.96)
Energy (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.71)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.71)