AITopics | matrix exponential

958c530554f78bcd8e97125b70e6973d-Supplemental.pdf

Neural Information Processing SystemsFeb-19-2026, 06:23:33 GMT

matrix, stationary distribution, transition matrix, (13 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England (0.04)
Asia > Middle East > Israel (0.04)
Asia > Japan (0.04)
(4 more...)

Genre: Workflow (0.46)

Industry:

Government (1.00)
Health & Medicine (0.68)
Banking & Finance (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

Shaping Social Activity by Incentivizing Users

Mehrdad Farajtabar, Nan Du, Manuel Gomez Rodriguez, Isabel Valera, Hongyuan Zha, Le Song

Neural Information Processing SystemsOct-2-2025, 21:57:22 GMT

Events in an online social network can be categorized roughly into endogenous events, where users just respond to the actions of their neighbors within the network, or exogenous events, where users take actions due to drives external to the network. How much external drive should be provided to each user, such that the network activity can be steered towards a target state? In this paper, we model social events using multivariate Hawkes processes, which can capture both endogenous and exogenous event intensities, and derive a time dependent linear relation between the intensity of exogenous events and the overall network activity. Exploiting this connection, we develop a convex optimization framework for determining the required level of external drive in order for the network to reach a desired activity level. We experimented with event data gathered from Twitter, and show that our method can steer the activity of the network more accurately than alternatives.

event intensity, hawke process, intensity, (16 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
Europe > Spain > Galicia > Madrid (0.04)
North America > United States > Hawaii (0.04)
(2 more...)

Industry:

Telecommunications > Networks (0.55)
Information Technology > Networks (0.55)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.66)

Add feedback

Efficient Learning of Continuous-Time Hidden Markov Models for Disease Progression

Yu-Ying Liu, Shuang Li, Fuxin Li, Le Song, James M. Rehg

Neural Information Processing SystemsOct-2-2025, 12:03:04 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, ct-hmm, machine learning, (15 more...)

Neural Information Processing Systems

Country: North America > United States > Georgia > Fulton County > Atlanta (0.04)

Genre: Research Report (0.47)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Add feedback

d3f06eef2ffac7faadbe3055a70682ac-Paper.pdf

Neural Information Processing SystemsAug-16-2025, 14:53:34 GMT

convolution, convolution exponential, transformation, (13 more...)

Neural Information Processing Systems

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.14)
Europe > Netherlands > North Holland > Amsterdam (0.05)
North America > United States > California > Los Angeles County > Long Beach (0.04)
(5 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

A Additional details regarding

Neural Information Processing SystemsAug-16-2025, 03:58:49 GMT

GANs trained by a two time-scale update rule converge to a local Nash equilibrium.

artificial intelligence, machine learning, natural language, (16 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England (0.04)
Asia > Middle East > Israel (0.04)
Asia > Japan (0.04)
(4 more...)

Genre: Workflow (0.46)

Industry:

Government (1.00)
Health & Medicine (0.68)
Banking & Finance (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

Rethinking RoPE: A Mathematical Blueprint for N-dimensional Positional Embedding

Liu, Haiping, Lin, Lijing, Sun, Jingyuan, Shangguan, Zhegong, Alvarez, Mauricio A., Zhou, Hongpeng

arXiv.org Artificial IntelligenceJul-16-2025

Rotary Position Embedding (RoPE) is widely adopted in large language models (LLMs) due to its efficient encoding of relative positions with strong extrapolation capabilities. However, while its application in higher-dimensional input domains, such as 2D images, have been explored in several attempts, a unified theoretical framework is still lacking. To address this, we propose a systematic mathematical framework for RoPE grounded in Lie group and Lie algebra theory. We derive the necessary and sufficient conditions for any valid $N$-dimensional RoPE based on two core properties of RoPE - relativity and reversibility. We demonstrate that RoPE can be characterized as a basis of a maximal abelian subalgebra (MASA) in the special orthogonal Lie algebra, and that the commonly used axis-aligned block-diagonal RoPE, where each input axis is encoded by an independent 2x2 rotation block, corresponds to the maximal toral subalgebra. Furthermore, we reduce spatial inter-dimensional interactions to a change of basis, resolved by learning an orthogonal transformation. Our experiment results suggest that inter-dimensional interactions should be balanced with local structure preservation. Overall, our framework unifies and explains existing RoPE designs while enabling principled extensions to higher-dimensional modalities and tasks.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2504.06308

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

Discretization of Linear Systems using the Matrix Exponential

Dahdah, Steven, Forbes, James Richard

arXiv.org Artificial IntelligenceMay-27-2025

Discretizing continuous-time linear systems typically requires numerical integration. This document presents a convenient method for discretizing the dynamics, input, and process noise state-space matrices of a continuous-time linear system using a single matrix exponential.

artificial intelligence, discretization, exp, (12 more...)

arXiv.org Artificial Intelligence

2505.18187

Country:

North America > Canada > Quebec > Montreal (0.15)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.05)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence (0.37)

Add feedback

Shaping Social Activity by Incentivizing Users

Mehrdad Farajtabar, Nan Du, Manuel Gomez Rodriguez, Isabel Valera, Hongyuan Zha, Le Song

Neural Information Processing SystemsFeb-9-2025, 14:37:12 GMT

Events in an online social network can be categorized roughly into endogenous events, where users just respond to the actions of their neighbors within the network, or exogenous events, where users take actions due to drives external to the network. How much external drive should be provided to each user, such that the network activity can be steered towards a target state? In this paper, we model social events using multivariate Hawkes processes, which can capture both endogenous and exogenous event intensities, and derive a time dependent linear relation between the intensity of exogenous events and the overall network activity. Exploiting this connection, we develop a convex optimization framework for determining the required level of external drive in order for the network to reach a desired activity level. We experimented with event data gathered from Twitter, and show that our method can steer the activity of the network more accurately than alternatives.

artificial intelligence, machine learning, social media, (20 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
Europe > Spain > Galicia > Madrid (0.04)
North America > United States > Hawaii (0.04)
(2 more...)

Industry:

Telecommunications > Networks (0.55)
Information Technology > Networks (0.55)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.66)

Add feedback

Analytical Solution of a Three-layer Network with a Matrix Exponential Activation Function

Gai, Kuo, Zhang, Shihua

arXiv.org Machine LearningJul-1-2024

In practice, deeper networks tend to be more powerful than shallow ones, but this has not been understood theoretically. In this paper, we find a analytical solution of a three-layer network with a matrix exponential activation function, i.e., f(X) = W Our proof shows the power of depth and the use of a non-linear activation function, since one layer network can only solve one equation,i.e.,Y = W X. Deep neural networks have become successful in many fields, including computer vision, natural language processing, bioinformatics, etc. However, the mathematical principle of deep learning is still not fully understood, especially why deeper networks with non-linear activation functions tend to be more powerful than shallower ones. It is well known that sufficient large depth-2 neural networks with reasonable activation functions can approximate any continuous function on a bounded domain (Cybenko, 1989; Funahashi, 1989; Hornik et al., 1989; Barron, 1994; Pinkus, 1999), but this requires the width of networks to be exponential. Recent authors have shown that some functions can be approximated by deeper networks with fewer neurons than by shallower ones, such as radial functions (Eldan & Shamir, 2016), Boolean circuit (Rossman et al., 2015) or functions induced by neural network (Telgarsky, 2016).

activation function, exp, neural network, (10 more...)

arXiv.org Machine Learning

2407.0254

Country:

Asia > Middle East > Jordan (0.05)
Asia > China > Beijing > Beijing (0.04)
Asia > Vietnam > Long An Province > Tân An (0.04)

Genre: Research Report (0.82)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Exponential time differencing for matrix-valued dynamical systems

Shkeir, Nayef, Schäfer, Tobias, Grafke, Tobias

arXiv.org Artificial IntelligenceJun-19-2024

Matrix evolution equations occur in many applications, such as dynamical Lyapunov/Sylvester systems or Riccati equations in optimization and stochastic control, machine learning or data assimilation. In many cases, their tightest stability condition is coming from a linear term. Exponential time differencing (ETD) is known to produce highly stable numerical schemes by treating the linear term in an exact fashion. In particular, for stiff problems, ETD methods are a method of choice. We propose an extension of the class of ETD algorithms to matrix-valued dynamical equations. This allows us to produce highly efficient and stable integration schemes. We show their efficiency and applicability for a variety of real-world problems, from geophysical applications to dynamical problems in machine learning.

equation, matrix exponential, representation, (15 more...)

arXiv.org Artificial Intelligence

2406.13761

Country: