AITopics | author provide

Collaborating Authors

author provide

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Reviews: Discrete Flows: Invertible Generative Models of Discrete Data

Neural Information Processing SystemsJan-27-2025, 14:49:07 GMT

Originality: This paper is the first demonstration of flow-based models to discrete data. As such, the work is fairly novel. The flow-based modeling community has been wondering how to model discrete data for some time, and this paper provides an answer to this question. That being said, the main technical contribution amounts to using a modulo operator (Eq. I view this simplicity as a benefit of the approach, but some may view this a simple extension of existing techniques.

discrete flow, experiment, invertible generative model, (8 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language > Generation (0.42)
Information Technology > Artificial Intelligence > Machine Learning (0.34)

Add feedback

Reviews: The Thermodynamic Variational Objective

Neural Information Processing SystemsJan-24-2025, 05:17:36 GMT

The paper connects variational inference with thermodynamic integration, so that the data log-likelihood can be formulated as a 1D integration of the instantaneous ELBO in a unit interval. By applying a left Riemann sum, TVO, a novel lower bound for the marginal log likelihood, is derived in which the traditional variational ELBO is recovered when only one partition is used. The authors then design an importance-sampling-based gradient estimator to optimize the objective, and compare with other methods on both discrete and continuous deep generative models. Originality and Significance: the formulation of TVO is an interesting idea. Better optimization methods than the importance-sampling-based approach are worth further exploring.

author provide, rebuttal period, thermodynamic variational objective, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.37)

Add feedback

Reviews: Dual Path Networks

Neural Information Processing SystemsOct-8-2024, 13:28:32 GMT

The authors propose a new network architecture which is a combination of ResNets and DenseNets. They introduce a very informative theoretical formulation which can be used to formulate ResNets, DenseNets and their proposed architecture. Pros: () The paper is well written with theoretical and empirical results () The authors provide useful analysis and statistics () The impact of DPNs is shown on a variety of computer vision tasks () The performance of the DPNs on the presented vision tasks is compelling Cons: (-) Optional results on MS COCO would make the paper even stronger Network engineering is an important field and it is important that it is done correctly, with analysis and many in depth experiments. The impact of new architectures comes through their generalization capabilities. This paper does a good job on all of the above.

analysis and statistics, computer vision task, dual path network, (4 more...)

Neural Information Processing Systems

Genre: Summary/Review (0.46)

Technology: Information Technology > Artificial Intelligence > Vision (1.00)

Add feedback

Reviews: Online Learning for Multivariate Hawkes Processes

Neural Information Processing SystemsOct-8-2024, 04:08:20 GMT

This paper describes an algorithm for optizimization of Hawkes process parameters in on-line settings, where non-parametric form of a kernel is learnt. The paper reports a gradient approach to optimization, with theoretical analysis thereof. In particular, the authors provide: a regret bound, justification for simplification steps (discretization of time and truncation of time over which previous posts influence a new post), an approach to a tractable projection of the solution (a step in the algorithm), time complexity analysis. The paper is very well written, which is very helpful given it is mathematically involved. I found it tackling an important problem (on-line learning is important for large scale datasets, and non-parametricity is a very reasonable setting when it is hard to specify a reasonable kernel form a priori).

experiment, multivariate hawke process, online learning, (3 more...)

Neural Information Processing Systems

Industry: Education > Educational Setting > Online (0.96)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.84)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.73)

Add feedback

Reviews: On the Convergence and Robustness of Training GANs with Regularized Optimal Transport

Neural Information Processing SystemsOct-7-2024, 10:56:54 GMT

SUMMARY The authors investigate the task of training a Generative Adversarial Networks model based on optimal transport (OT) loss. They focus on regularized OT losses, and show that approximate gradients of these losses can be obtained by approximately solving regularized OT problem (Thm 4.1). As a consequence, a non-convex stochastic gradient method for minimizing this loss has a provable convergence rate to stationarity (Thm 4.2). The analysis also applies to Sinkhorn losses. The authors then explore numerically the behavior of a practical algorithm where the dual variable are parametrized by neural networks (the theory does not immediately apply because estimating the loss gradient becomes non-convex).

convergence and robustness, regularized optimal transport, training gan, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.73)

Add feedback