AITopics | re-examination

Re-examination of the Role of Latent Variables in Sequence Modeling

Neural Information Processing SystemsDec-26-2025, 00:16:38 GMT

With latent variables, stochastic recurrent models have achieved state-of-the-art performance in modeling sound-wave sequence. However, opposite results are also observed in other domains, where standard recurrent networks often outperform stochastic models. To better understand this discrepancy, we re-examine the roles of latent variables in stochastic recurrent models for speech density estimation. Our analysis reveals that under the restriction of fully factorized output distribution in previous evaluations, the stochastic variants were implicitly leveraging intra-step correlation but the deterministic recurrent baselines were prohibited to do so, resulting in an unfair comparison. To correct the unfairness, we remove such restriction in our re-examination, where all the models can explicitly leverage intra-step correlation with an auto-regressive structure. Over a diverse set of univariate and multivariate sequential data, including human speech, MIDI music, handwriting trajectory, and frame-permuted speech, our results show that stochastic recurrent models fail to deliver the performance advantage claimed in previous work.

latent variable, name change, re-examination, (6 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.60)

Technology: Information Technology > Artificial Intelligence (0.40)

Add feedback

Toward More Reliable Artificial Intelligence: Reducing Hallucinations in Vision-Language Models

Sanogo, Kassoum, Ardiccioni, Renzo

arXiv.org Artificial IntelligenceDec-11-2025

Vision-language models (VLMs) frequently generate hallucinated content plausible but incorrect claims about image content. We propose a training-free self-correction framework enabling VLMs to iteratively refine responses through uncertainty-guided visual re-attention. Our method combines multidimensional uncertainty quantification (token entropy, attention dispersion, semantic consistency, claim confidence) with attention-guided cropping of under-explored regions. Operating entirely with frozen, pretrained VLMs, our framework requires no gradient updates. We validate our approach on the POPE and MMHAL BENCH benchmarks using the Qwen2.5-VL-7B [23] architecture. Experimental results demonstrate that our method reduces hallucination rates by 9.8 percentage points compared to the baseline, while improving object existence accuracy by 4.7 points on adversarial splits. Furthermore, qualitative analysis confirms that uncertainty-guided re-attention successfully grounds corrections in visual evidence where standard decoding fails. We validate our approach on Qwen2.5-VL-7B [23], with plans to extend validation across diverse architectures in future versions. We release our code and methodology to facilitate future research in trustworthy multimodal systems.

artificial intelligence, machine learning, natural language, (14 more...)

arXiv.org Artificial Intelligence

2512.07564

Genre: Research Report > New Finding (0.34)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Add feedback

Reviews: Re-examination of the Role of Latent Variables in Sequence Modeling

Neural Information Processing SystemsJan-27-2025, 05:33:01 GMT

The authors discuss the role of latent variable models in sequence models where multiple observations of the time series are modeled at once using a factorized form which assumes conditional independence. This assumption is almost surely violated in practice, thus limiting the performance of such models. When the sequence model is provided with latent variables it is possible to account for the correlation structure of the likely correlated observations within a time window, thus resulting in better performance compared to models without latent variables. Results on multiple datasets demonstrate this intuition. Though the analysis presented by the authors is clear, well motivated and justified, the paper seems to downplay the importance and motivation of sequence models that consider multiple observations at once in a windowed manner, and how sequence models with stochastic (latent) variables by their ability to capture correlation structure alleviate some of the issues associated with windowing, i.e., the conditional independence assumption.

latent variable, sequence model, sequence modeling, (3 more...)

Neural Information Processing Systems

Genre: Summary/Review (0.63)

Technology: Information Technology > Artificial Intelligence (0.88)

Add feedback

Re-examination of the Role of Latent Variables in Sequence Modeling

Neural Information Processing SystemsOct-10-2024, 23:42:04 GMT

With latent variables, stochastic recurrent models have achieved state-of-the-art performance in modeling sound-wave sequence. However, opposite results are also observed in other domains, where standard recurrent networks often outperform stochastic models. To better understand this discrepancy, we re-examine the roles of latent variables in stochastic recurrent models for speech density estimation. Our analysis reveals that under the restriction of fully factorized output distribution in previous evaluations, the stochastic variants were implicitly leveraging intra-step correlation but the deterministic recurrent baselines were prohibited to do so, resulting in an unfair comparison. To correct the unfairness, we remove such restriction in our re-examination, where all the models can explicitly leverage intra-step correlation with an auto-regressive structure.

latent variable, recurrent model, sequence modeling, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.43)

Add feedback

Re-examination of the Role of Latent Variables in Sequence Modeling

Lai, Guokun, Dai, Zihang, Yang, Yiming, Yoo, Shinjae

Neural Information Processing SystemsMar-18-2020, 23:46:41 GMT

With latent variables, stochastic recurrent models have achieved state-of-the-art performance in modeling sound-wave sequence. However, opposite results are also observed in other domains, where standard recurrent networks often outperform stochastic models. To better understand this discrepancy, we re-examine the roles of latent variables in stochastic recurrent models for speech density estimation. Our analysis reveals that under the restriction of fully factorized output distribution in previous evaluations, the stochastic variants were implicitly leveraging intra-step correlation but the deterministic recurrent baselines were prohibited to do so, resulting in an unfair comparison. To correct the unfairness, we remove such restriction in our re-examination, where all the models can explicitly leverage intra-step correlation with an auto-regressive structure.

latent variable, recurrent model, sequence modeling, (4 more...)

Neural Information Processing Systems

Genre: Research Report (0.41)

Technology: Information Technology > Artificial Intelligence (0.47)

Add feedback

Filters

Collaborating Authors

re-examination

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Re-examination of the Role of Latent Variables in Sequence Modeling

Toward More Reliable Artificial Intelligence: Reducing Hallucinations in Vision-Language Models

Reviews: Re-examination of the Role of Latent Variables in Sequence Modeling

Re-examination of the Role of Latent Variables in Sequence Modeling

Re-examination of the Role of Latent Variables in Sequence Modeling