AITopics | ucf101

Sequential Memory with Temporal Predictive Coding Supplementary Materials

Neural Information Processing SystemsFeb-15-2026, 18:09:03 GMT

In Algorithm 1 we present the memorizing and recalling procedures of the single-layer tPC.Algorithm 1 Memorizing and recalling with single-layer tPC Here we present the proof for Property 1 in the main text, that the single-layer tPC can be viewed as a "whitened" version of the AHN. When applied to the data sequence, it whitens the data such that (i.e., Eq.16 in the main text): These observations are consistent with our numerical results shown in Figure 1. MCAHN has a much larger MSE than that of the tPC because of the entirely wrong recalls. In Figure 1 we also present the online recall results of the models in MovingMNIST, CIFAR10 and UCF101. In Fig 4 we show a natural example of aliased sequences where a movie of a human doing push-ups is memorized and recalled by the model.

artificial intelligence, machine learning, tpc, (15 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)

Genre: Research Report > New Finding (0.35)

Industry: Law > Litigation (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.48)

Add feedback

LearningaCondensed FrameforMemory-Efficient VideoClass-IncrementalLearning SupplementaryMaterials

Neural Information Processing SystemsFeb-11-2026, 21:31:14 GMT

We observe that the learned prompts have no intuitivesemantics.

artificial intelligence, dataset, learningacondensed frameformemory-efficient videoclass-incrementallearning supplementarymaterial, (11 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.49)

Add feedback

155a94c71f0a2a3cb7eacbf733b5c64b-Paper-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 09:16:30 GMT

information, representation, visual cortex, (16 more...)

Neural Information Processing Systems

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > China > Guangdong Province > Shenzhen (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.93)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Media (0.69)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

3def184ad8f4755ff269862ea77393dd-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-8-2026, 04:16:16 GMT

epoch, representation, ucf101, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Long-Range Feedback Spiking Network Captures Dynamic and Static Representations of the Visual Cortex under Movie Stimuli

Neural Information Processing SystemsOct-9-2025, 19:11:31 GMT

However, existing DNNs are mostly designed to analyze neural responses to static images, relying on feedforward structures and lacking physiological neuronal mechanisms. There is limited insight into how the visual cortex represents natural movie stimuli that contain context-rich information.

information, representation, visual cortex, (16 more...)

Neural Information Processing Systems

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > China > Guangdong Province > Shenzhen (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.93)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

Sequential Memory with Temporal Predictive Coding Supplementary Materials

Neural Information Processing SystemsOct-9-2025, 00:48:47 GMT

In Algorithm 1 we present the memorizing and recalling procedures of the single-layer tPC.Algorithm 1 Memorizing and recalling with single-layer tPC Here we present the proof for Property 1 in the main text, that the single-layer tPC can be viewed as a "whitened" version of the AHN. When applied to the data sequence, it whitens the data such that (i.e., Eq.16 in the main text): These observations are consistent with our numerical results shown in Figure 1. MCAHN has a much larger MSE than that of the tPC because of the entirely wrong recalls. In Figure 1 we also present the online recall results of the models in MovingMNIST, CIFAR10 and UCF101. In Fig 4 we show a natural example of aliased sequences where a movie of a human doing push-ups is memorized and recalled by the model.

artificial intelligence, machine learning, tpc, (15 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)

Genre: Research Report > New Finding (0.35)

Industry: Law > Litigation (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.48)

Add feedback

discussion and implementation details. 23 [ All Reviewers ] Related work. We agree with the reviewers that a more extended discussion is required for related

Neural Information Processing SystemsOct-2-2025, 17:52:27 GMT

This model is trained on UCF101 with the same schedule as CoCLR for a fair comparison. We note a recent arXiv paper (MemDPC, to appear in ECCV2020 by Han et al.) has also used both RGB and optical We will add these discussions. We actually used the same augmentation as DPC in their released codebase. These are the core contributions of our paper. CoCLR-RGB model gets 70.2% by linear probing.

artificial intelligence, epoch, machine learning, (18 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Unsupervised Video Continual Learning via Non-Parametric Deep Embedded Clustering

Kurpukdee, Nattapong, Bors, Adrian G.

arXiv.org Artificial IntelligenceSep-1-2025

We propose a realistic scenario for the unsupervised video learning where neither task boundaries nor labels are provided when learning a succession of tasks. We also provide a non-parametric learning solution for the under-explored problem of unsupervised video continual learning. Videos represent a complex and rich spatio-temporal media information, widely used in many applications, but which have not been sufficiently explored in unsupervised continual learning. Prior studies have only focused on supervised continual learning, relying on the knowledge of labels and task boundaries, while having labeled data is costly and not practical. To address this gap, we study the unsupervised video continual learning (uVCL). uVCL raises more challenges due to the additional computational and memory requirements of processing videos when compared to images. We introduce a general benchmark experimental protocol for uVCL by considering the learning of unstructured video data categories during each task. We propose to use the Kernel Density Estimation (KDE) of deep embedded video features extracted by unsupervised video transformer networks as a non-parametric probabilistic representation of the data. We introduce a novelty detection criterion for the incoming new task data, dynamically enabling the expansion of memory clusters, aiming to capture new knowledge when learning a succession of tasks. We leverage the use of transfer learning from the previous tasks as an initial state for the knowledge transfer to the current learning task. We found that the proposed methodology substantially enhances the performance of the model when successively learning many tasks. We perform in-depth evaluations on three standard video action recognition datasets, including UCF101, HMDB51, and Something-to-Something V2, without using any labels or class boundaries.

artificial intelligence, deep learning, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2508.21773

Genre: Research Report > New Finding (0.68)

Industry: Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

c8ac22c0d4b263618f2a4f4657948912-Supplemental-Conference.pdf

Neural Information Processing SystemsAug-18-2025, 21:15:58 GMT

artificial intelligence, condensed frame, machine learning, (16 more...)

Neural Information Processing Systems

Country:

Asia > China > Shaanxi Province > Xi'an (0.04)
Asia > China > Hong Kong (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Toward Lightweight and Fast Decoders for Diffusion Models in Image and Video Generation

Buzovkin, Alexey, Shilov, Evgeny

arXiv.org Artificial IntelligenceMar-6-2025

We investigate methods to reduce inference time and memory footprint in stable diffusion models by introducing lightweight decoders for both image and video synthesis. Traditional latent diffusion pipelines rely on large Variational Autoencoder decoders that can slow down generation and consume considerable GPU memory. We propose custom-trained decoders using lightweight Vision Transformer and Taming Transformer architectures. Experiments show up to 15% overall speed-ups for image generation on COCO2017 and up to 20 times faster decoding in the sub-module, with additional gains on UCF-101 for video tasks. Memory requirements are moderately reduced, and while there is a small drop in perceptual quality compared to the default decoder, the improvements in speed and scalability are crucial for large-scale inference scenarios such as generating 100K images. Our work is further contextualized by advances in efficient video generation, including dual masking strategies, illustrating a broader effort to improve the scalability and efficiency of generative models.

architecture, decoder, lightweight decoder, (13 more...)

arXiv.org Artificial Intelligence

2503.04871

Country:

Asia > China > Shanghai > Shanghai (0.04)
Asia > China > Jiangsu Province > Nanjing (0.04)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.56)

Add feedback

Filters

Collaborating Authors

ucf101

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Sequential Memory with Temporal Predictive Coding Supplementary Materials

LearningaCondensed FrameforMemory-Efficient VideoClass-IncrementalLearning SupplementaryMaterials

155a94c71f0a2a3cb7eacbf733b5c64b-Paper-Conference.pdf

3def184ad8f4755ff269862ea77393dd-AuthorFeedback.pdf

Long-Range Feedback Spiking Network Captures Dynamic and Static Representations of the Visual Cortex under Movie Stimuli

Sequential Memory with Temporal Predictive Coding Supplementary Materials

discussion and implementation details. 23 [ All Reviewers ] Related work. We agree with the reviewers that a more extended discussion is required for related

Unsupervised Video Continual Learning via Non-Parametric Deep Embedded Clustering

c8ac22c0d4b263618f2a4f4657948912-Supplemental-Conference.pdf

Toward Lightweight and Fast Decoders for Diffusion Models in Image and Video Generation