AITopics | concatenation

TriBERT: Full-body Human-centric Audio-visual Representation Learning for Visual Sound Separation (Supplementary Materials)

Neural Information Processing SystemsApr-25-2026, 21:40:27 GMT

Recall that for the n-way multiple choice setting, n 1 choices are negative pairs and only one pair is positive. Accordingly, for n = 4, 3 distractors are sampled, each with an incorrect pose embedding, while the 4th choice contains the matching pose embedding for the given vision and audio embeddings. In other words, the fusion embedding consisting of the vision and audio embeddings is kept as the anchor while negatives are sampled from the pose embeddings only. Of the 3 negative pose embeddings, 2 are considered "easy" negatives, sampled randomly from the entire training set, while the last one is a "hard" negative, sampled randomly from a pool of 25 embeddings corresponding to the 25 nearest neighbours of the anchor vision embedding. In the n = 3case, 2 hard negatives and no easy negatives are used, with the same nearest neighbour sampling method based on the anchorshared weights embedding.

artificial intelligence, machine learning, modality, (12 more...)

Neural Information Processing Systems

Country: North America > Canada (0.15)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.68)
Information Technology > Artificial Intelligence > Vision (0.49)

Add feedback

ADerivation of time-evolving attention operators

Neural Information Processing SystemsApr-25-2026, 06:51:44 GMT

We show the full derivation of Equation 6 as follows. Recall that X0i is the concatenation of Xi and Tl. The model variation used here in TransEvolve-fullFF. Thus, on the limiting case, we get E[Ul(Ul)>] = 1I where I is the d-dimensional identity matrix. This way, Ul2 dd approximates a rotation matrix as we choose σ = O(d).

artificial intelligence, exp, machine learning, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.49)

Add feedback

FishNet: A Versatile Backbone for Image, Region, and Pixel Level Prediction

Shuyang Sun, Jiangmiao Pang, Jianping Shi, Shuai Yi, Wanli Ouyang

Neural Information Processing SystemsMar-15-2026, 17:25:14 GMT

The basic principles in designing convolutional neural network (CNN) structures for predicting objects on different levels, e.g., image-level, region-level, and pixellevel, are diverging. Generally, network structures designed specifically for image classification are directly used as default backbone structure for other tasks including detection and segmentation, but there is seldom backbone structure designed under the consideration of unifying the advantages of networks designed for pixellevel or region-level predicting tasks, which may require very deep features with high resolution. Towards this goal, we design a fish-like network, called FishNet. In FishNet, the information of all resolutions is preserved and refined for the final task. Besides, we observe that existing works still cannot directly propagate the gradient information from deep layers to shallow layers. Our design can better handle this problem. Extensive experiments have been conducted to demonstrate the remarkable performance of the FishNet. In particular, on ImageNet-1k, the accuracy of FishNet is able to surpass the performance of DenseNet and ResNet with fewer parameters. FishNet was applied as one of the modules in the winning entry of the COCO Detection 2018 challenge.

artificial intelligence, deep learning, machine learning, (17 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

8fb21ee7a2207526da55a679f0332de2-Paper.pdf

Neural Information Processing SystemsFeb-19-2026, 06:21:48 GMT

activation function, architecture, residual flow, (13 more...)

Neural Information Processing Systems

Country:

North America > Canada > Ontario > Toronto (0.14)
Europe > Netherlands > North Holland > Amsterdam (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

cf70320e93c08b39b1b29a348097a376-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-17-2026, 05:15:55 GMT

artificial intelligence, machine learning, policy network, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.31)

Add feedback

Neural Edit Operations for Biological Sequences

Satoshi Koide, Keisuke Kawano, Takuro Kutsuna

Neural Information Processing SystemsFeb-14-2026, 16:02:21 GMT

Neural Information Processing Systems http://nips.cc/

accuracy, architecture, regular expression, (16 more...)

Neural Information Processing Systems

Country:

North America > Canada > Quebec > Montreal (0.04)
Asia > Japan > Honshū > Tōhoku > Fukushima Prefecture > Fukushima (0.04)

Genre: Research Report (0.46)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.70)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.93)
Information Technology > Artificial Intelligence > Cognitive Science (0.93)

Add feedback

4ea06fbc83cdd0a06020c35d50e1e89a-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-12-2026, 03:13:18 GMT

ac-gan, cgan, experiment, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.30)

Add feedback

c017e92288b5056c578bb6b0b69d9e76-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 17:07:32 GMT

deep forest, new feature, random forest, (15 more...)

Neural Information Processing Systems

Country:

Asia > China > Jiangsu Province > Nanjing (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
(2 more...)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)

Add feedback

cd10c7f376188a4a2ca3e8fea2c03aeb-Supplemental.pdf

Neural Information Processing SystemsFeb-10-2026, 10:29:06 GMT

arma layer, convolution, stability, (15 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

Invertible DenseNets with Concatenated LipSwish

Neural Information Processing SystemsFeb-9-2026, 21:29:46 GMT

We introduce Invertible Dense Networks (i-DenseNets), a more parameter efficient extension of Residual Flows. The method relies on an analysis of the Lipschitz continuity of the concatenation in DenseNets, where we enforce invertibility of the network by satisfying the Lipschitz constant. Furthermore, we propose a learnable weighted concatenation, which not only improves the model performance but also indicates the importance of the concatenated weighted representation. Additionally, we introduce the Concatenated LipSwish as activation function, for which we show how to enforce the Lipschitz condition and which boosts performance. The new architecture, i-DenseNet, out-performs Residual Flow and other flow-based models on density estimation evaluated in bits per dimension, where we utilize an equal parameter budget. Moreover, we show that the proposed model out-performs Residual Flows when trained as a hybrid model where the model is both a generative and a discriminative model.

artificial intelligence, concatenated lipswish, machine learning, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.41)

Add feedback

Filters

Collaborating Authors

concatenation

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

TriBERT: Full-body Human-centric Audio-visual Representation Learning for Visual Sound Separation (Supplementary Materials)

ADerivation of time-evolving attention operators

FishNet: A Versatile Backbone for Image, Region, and Pixel Level Prediction

8fb21ee7a2207526da55a679f0332de2-Paper.pdf

cf70320e93c08b39b1b29a348097a376-Supplemental-Conference.pdf

Neural Edit Operations for Biological Sequences

4ea06fbc83cdd0a06020c35d50e1e89a-AuthorFeedback.pdf

c017e92288b5056c578bb6b0b69d9e76-Supplemental-Conference.pdf

cd10c7f376188a4a2ca3e8fea2c03aeb-Supplemental.pdf

Invertible DenseNets with Concatenated LipSwish