AITopics | activation probability

Collaborating Authors

activation probability

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Learning Influence Functions from Incomplete Observations

Xinran He, Ke Xu, David Kempe, Yan Liu

Neural Information Processing SystemsMar-23-2026, 11:38:08 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, social media, (18 more...)

Neural Information Processing Systems

Country: North America > United States > California > Los Angeles County > Los Angeles (0.28)

Genre: Research Report > New Finding (0.46)

Industry: Government (0.46)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Data Science (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

4ee78d4122ef8503fe01cdad3e9ea4ee-Supplemental.pdf

Neural Information Processing SystemsFeb-8-2026, 09:46:35 GMT

corr, node, type 2, (17 more...)

Neural Information Processing Systems

Country:

Asia > Singapore (0.05)
North America > United States > California > Monterey County > Monterey (0.04)
North America > Canada (0.04)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.48)

Add feedback

4ee78d4122ef8503fe01cdad3e9ea4ee-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-8-2026, 09:46:18 GMT

broader impact, corr, extension, (14 more...)

Neural Information Processing Systems

Technology:

Information Technology > Communications (0.49)
Information Technology > Artificial Intelligence (0.31)

Add feedback

How does Alignment Enhance LLMs' Multilingual Capabilities? A Language Neurons Perspective

Zhang, Shimao, Lai, Zhejian, Liu, Xiang, She, Shuaijie, Liu, Xiao, Gong, Yeyun, Huang, Shujian, Chen, Jiajun

arXiv.org Artificial IntelligenceNov-25-2025

Multilingual Alignment is an effective and representative paradigm to enhance LLMs' multilingual capabilities, which transfers the capabilities from the high-resource languages to the low-resource languages. Meanwhile, some research on language-specific neurons provides a new perspective to analyze and understand LLMs' mechanisms. However, we find that there are many neurons that are shared by multiple but not all languages and cannot be correctly classified. In this work, we propose a ternary classification methodology that categorizes neurons into three types, including language-specific neurons, language-related neurons, and general neurons. And we propose a corresponding identification algorithm to distinguish these different types of neurons. Furthermore, based on the distributional characteristics of different types of neurons, we divide the LLMs' internal process for multilingual inference into four parts: (1) multilingual understanding, (2) shared semantic space reasoning, (3) multilingual output space transformation, and (4) vocabulary space outputting. Additionally, we systematically analyze the models before and after alignment with a focus on different types of neurons. We also analyze the phenomenon of ''Spontaneous Multilingual Alignment''. Overall, our work conducts a comprehensive investigation based on different types of neurons, providing empirical results and valuable insights to better understand multilingual alignment and multilingual capabilities of LLMs.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2505.21505

Country:

Asia (0.68)
North America > United States (0.28)
Europe > Austria (0.28)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Online Influence Maximization under Independent Cascade Model with Semi-Bandit Feedback

Zheng Wen, Branislav Kveton, Michal Valko, Sharan Vaswani

Neural Information Processing SystemsNov-21-2025, 10:57:17 GMT

We study the online influence maximization problem in social networks under the independent cascade model. Specifically, we aim to learn the set of "best

data mining, machine learning, node, (20 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
North America > Canada > British Columbia (0.04)
(2 more...)

Industry: Information Technology > Services (0.36)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.47)

Add feedback

4ee78d4122ef8503fe01cdad3e9ea4ee-Supplemental.pdf

Neural Information Processing SystemsOct-2-2025, 21:47:56 GMT

artificial intelligence, corr, node, (18 more...)

Neural Information Processing Systems

Country: North America > United States > California (0.14)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.48)

Add feedback

Reviewer

Neural Information Processing SystemsOct-2-2025, 21:47:37 GMT

We thank all the reviewers for the detailed perusal and valuable suggestions. Please find our responses below. Further advantages follow from Corollaries 1-3. We will be happy to cite more works, including those suggested. Beyond qualitative statement through plots: The POC in Sec 4.2 provides a technical treatment to compare As the comparison is graph-dependent, a completely general conclusion cannot be drawn.

artificial intelligence, corr, extension, (15 more...)

Neural Information Processing Systems

Technology:

Information Technology > Communications (0.49)
Information Technology > Artificial Intelligence (0.31)

Add feedback

Deep Learning Optimization of Two-State Pinching Antennas Systems

Karagiannidis, Odysseas G., Galanopoulou, Victoria E., Diamantoulakis, Panagiotis D., Ding, Zhiguo, Dobre, Octavia

arXiv.org Artificial IntelligenceJul-9-2025

The evolution of wireless communication systems requires flexible, energy-efficient, and cost-effective antenna technologies. Pinching antennas (PAs), which can dynamically control electromagnetic wave propagation through binary activation states, have recently emerged as a promising candidate. In this work, we investigate the problem of optimally selecting a subset of fixed-position PAs to activate in a waveguide, when the aim is to maximize the communication rate at a user terminal. Due to the complex interplay between antenna activation, waveguide-induced phase shifts, and power division, this problem is formulated as a combinatorial fractional 0-1 quadratic program. To efficiently solve this challenging problem, we use neural network architectures of varying complexity to learn activation policies directly from data, leveraging spatial features and signal structure. Furthermore, we incorporate user location uncertainty into our training and evaluation pipeline to simulate realistic deployment conditions. Simulation results demonstrate the effectiveness and robustness of the proposed models.

artificial intelligence, deep learning, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2507.06222

Country: Europe (0.28)

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.83)

Add feedback

Analytic theory of dropout regularization

Mori, Francesco, Mignacco, Francesca

arXiv.org Machine LearningMay-13-2025

Dropout is a regularization technique widely used in training artificial neural networks to mitigate overfitting. It consists of dynamically deactivating subsets of the network during training to promote more robust representations. Despite its widespread adoption, dropout probabilities are often selected heuristically, and theoretical explanations of its success remain sparse. Here, we analytically study dropout in two-layer neural networks trained with online stochastic gradient descent. In the high-dimensional limit, we derive a set of ordinary differential equations that fully characterize the evolution of the network during training and capture the effects of dropout. We obtain a number of exact results describing the generalization error and the optimal dropout probability at short, intermediate, and long training times. Our analysis shows that dropout reduces detrimental correlations between hidden nodes, mitigates the impact of label noise, and that the optimal dropout probability increases with the level of noise in the data. Our results are validated by extensive numerical simulations.

artificial intelligence, dropout, machine learning, (17 more...)

arXiv.org Machine Learning

2505.07792

Country:

North America > United States > New York > New York County > New York City (0.14)
North America > United States > New Jersey > Mercer County > Princeton (0.04)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
(2 more...)

Genre: Research Report > New Finding (0.66)

Industry: Education (0.94)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.54)

Add feedback

An Information-Theoretic Approach to Identifying Formulaic Clusters in Textual Data

Yoffe, Gideon, Segev, Yair, Sober, Barak

arXiv.org Artificial IntelligenceMar-10-2025

Texts, whether literary or historical, exhibit structural and stylistic patterns shaped by their purpose, authorship, and cultural context. Formulaic texts, characterized by repetition and constrained expression, tend to have lower variability in self-information compared to more dynamic compositions. Identifying such patterns in historical documents, particularly multi-author texts like the Hebrew Bible provides insights into their origins, purpose, and transmission. This study aims to identify formulaic clusters -- sections exhibiting systematic repetition and structural constraints -- by analyzing recurring phrases, syntactic structures, and stylistic markers. However, distinguishing formulaic from non-formulaic elements in an unsupervised manner presents a computational challenge, especially in high-dimensional textual spaces where patterns must be inferred without predefined labels. To address this, we develop an information-theoretic algorithm leveraging weighted self-information distributions to detect structured patterns in text, unlike covariance-based methods, which become unstable in small-sample, high-dimensional settings, our approach directly models variations in self-information to identify formulaicity. By extending classical discrete self-information measures with a continuous formulation based on differential self-information, our method remains applicable across different types of textual representations, including neural embeddings under Gaussian priors. Applied to hypothesized authorial divisions in the Hebrew Bible, our approach successfully isolates stylistic layers, providing a quantitative framework for textual stratification. This method enhances our ability to analyze compositional patterns, offering deeper insights into the literary and cultural evolution of texts shaped by complex authorship and editorial processes.

dimension, formulaic cluster, probability, (15 more...)

arXiv.org Artificial Intelligence

2503.07303

Country:

Asia > Middle East > Israel > Jerusalem District > Jerusalem (0.04)
North America > United States > New York (0.04)
North America > Canada > Ontario > Toronto (0.04)
Europe > Switzerland > Zürich > Zürich (0.04)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.67)

Add feedback