AITopics | sample generation

Collaborating Authors

sample generation

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

6a27ee6f66d13557f15f070274c51721-Paper-Conference.pdf

Neural Information Processing SystemsFeb-13-2026, 08:40:01 GMT

artificial intelligence, machine learning, score function, (18 more...)

Neural Information Processing Systems

Country:

Asia > China > Shanghai > Shanghai (0.04)
Asia > China > Anhui Province > Hefei (0.04)
North America > United States > Tennessee > Davidson County > Nashville (0.04)
(2 more...)

Genre: Research Report (0.45)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.67)

Add feedback

Self-Diagnosing GAN: Diagnosing Underrepresented Samples in Generative Adversarial Networks

Neural Information Processing SystemsDec-23-2025, 18:46:28 GMT

Despite remarkable performance in producing realistic samples, Generative Adversarial Networks (GANs) often produce low-quality samples near low-density regions of the data manifold, e.g., samples of minor groups. Many techniques have been developed to improve the quality of generated samples, either by post-processing generated samples or by pre-processing the empirical data distribution, but at the cost of reduced diversity. To promote diversity in sample generation without degrading the overall quality, we propose a simple yet effective method to diagnose and emphasize underrepresented samples during training of a GAN. The main idea is to use the statistics of the discrepancy between the data distribution and the model distribution at each data instance. Based on the observation that the underrepresented samples have a high average discrepancy or high variability in discrepancy, we propose a method to emphasize those samples during training of a GAN. Our experimental results demonstrate that the proposed method improves GAN performance on various datasets, and it is especially effective in improving the quality and diversity of sample generation for minor groups.

diagnosing underrepresented sample, generative adversarial network, self-diagnosing gan, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.70)

Add feedback

Manifold Decoders: A Framework for Generative Modeling from Nonlinear Embeddings

Thakare, Riddhish, Akugri, Kingdom Mutala

arXiv.org Artificial IntelligenceOct-16-2025

High-dimensional data analysis and visualization constitute fundamental challenges in machine learning, where nonlinear dimensionality reduction (NLDR) techniques have proven instrumental in discovering low-dimensional embeddings that preserve essential structural properties of complex datasets. These methods, encompassing techniques such as t-distributed Stochastic Neighbor Embedding (t-SNE) [13], Isometric Mapping (Isomap) [12], Locally Linear Embedding (LLE) [10] and Laplacian Eigenmaps [1] excel at revealing intrinsic data manifolds and facilitating interpretable visualizations of high-dimensional phenomena. However, a critical architectural limitation pervades the entire class of traditional NLDR methods: they inherently lack reconstruction capabilities, operating as one-way transformations that map from high-dimensional input spaces to low-dimensional embeddings without providing mechanisms for inverse mapping. This fundamental asymmetry severely constrains the applicability of NLDR techniques in generative modelling, data synthesis, and interactive exploration scenarios where bidirectional transformations are essential. Unlike autoen-coders, which explicitly incorporate decoder architectures during training, classical manifold learning approaches such as t-SNE, Uniform Manifold Approximation and Projection (UMAP) [8], and diffusion maps optimize embeddings through eigen decomposition, neighbourhood preservation, or probabilistic formulations that do not naturally yield invertible mappings. Consequently, despite their superior performance in preserving local neighbourhood structures and global topological properties, these methods remain confined to analysis and visualization tasks. This work addresses the reconstruction gap in NLDR methods by developing specialized decoder architectures that enable bidirectional mapping between high-dimensional data and learned manifold representations.

artificial intelligence, machine learning, manifold, (16 more...)

arXiv.org Artificial Intelligence

2510.13622

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.91)

Add feedback

6a27ee6f66d13557f15f070274c51721-Paper-Conference.pdf

Neural Information Processing SystemsOct-8-2025, 20:29:28 GMT

artificial intelligence, machine learning, score function, (18 more...)

Neural Information Processing Systems

Country:

Asia > China > Shanghai > Shanghai (0.04)
Asia > China > Anhui Province > Hefei (0.04)
North America > United States > Tennessee > Davidson County > Nashville (0.04)
(2 more...)

Genre: Research Report (0.45)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.67)

Add feedback

Learning Majority-to-Minority Transformations with MMD and Triplet Loss for Imbalanced Classification

Cha, Suman, Kim, Hyunjoong

arXiv.org Machine LearningSep-16-2025

Traditional oversampling techniques--including SMOTE and its variants--generate synthetic minority samples via local interpolation but fail to capture global data distributions in high-dimensional spaces. Deep generative models based on GANs offer richer distribution modeling yet suffer from training instability and mode collapse under severe imbalance. To overcome these limitations, we introduce an oversampling framework that learns a parametric transformation to map majority samples into the minority distribution. Our approach minimizes the maximum mean discrepancy (MMD) between transformed and true minority samples for global alignment, and incorporates a triplet loss regularizer to enforce boundary awareness by guiding synthesized samples toward challenging borderline regions. We evaluate our method on 29 synthetic and real-world datasets, demonstrating consistent improvements over classical and generative baselines in AUROC, G-mean, F1-score, and MCC. These results confirm the robustness, computational efficiency, and practical utility of the proposed framework for imbalanced classification tasks.

conference, dataset, international conference, (15 more...)

arXiv.org Machine Learning

2509.11511

Genre:

Research Report > New Finding (0.87)
Research Report > Experimental Study (0.68)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

A novel forecasting framework combining virtual samples and enhanced Transformer models for tourism demand forecasting

Diao, Tingting, Wu, Xinzhang, Yang, Lina, Xiao, Ling, Dong, Yunxuan

arXiv.org Artificial IntelligenceMar-25-2025

Accurate tourism demand forecasting is hindered by limited historical data and complex spatiotemporal dependencies among tourist origins. A novel forecasting framework integrating virtual sample generation and a novel Transformer predictor addresses constraints arising from restricted data availability. A spatiotemporal GAN produces realistic virtual samples by dynamically modeling spatial correlations through a graph convolutional network, and an enhanced Transformer captures local patterns with causal convolutions and long-term dependencies with self-attention,eliminating autoregressive decoding. A joint training strategy refines virtual sample generation based on predictor feedback to maintain robust performance under data-scarce conditions. Experimental evaluations on real-world daily and monthly tourism demand datasets indicate a reduction in average MASE by 18.37% compared to conventional Transformer-based models, demonstrating improved forecasting accuracy. The integration of adaptive spatiotemporal sample augmentation with a specialized Transformer can effectively address limited-data forecasting scenarios in tourism management.

data mining, machine learning, predictor, (19 more...)

arXiv.org Artificial Intelligence

2503.19423

Country:

Europe > United Kingdom (0.14)
North America > United States (0.14)
Asia > Macao (0.06)
(23 more...)

Genre: Research Report > New Finding (0.46)

Industry: Consumer Products & Services > Travel (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Generative Modeling of Microweather Wind Velocities for Urban Air Mobility

Shah, Tristan A., Stanley, Michael C., Warner, James E.

arXiv.org Artificial IntelligenceMar-4-2025

Motivated by the pursuit of safe, reliable, and weather-tolerant urban air mobility (UAM) solutions, this work proposes a generative modeling approach for characterizing microweather wind velocities. Microweather, or the weather conditions in highly localized areas, is particularly complex in urban environments owing to the chaotic and turbulent nature of wind flows. Furthermore, traditional means of assessing local wind fields are not generally viable solutions for UAM applications: 1) field measurements that would rely on permanent wind profiling systems in operational air space are not practical, 2) physics-based models that simulate fluid dynamics at a sufficiently high resolution are not computationally tractable, and 3) data-driven modeling approaches that are largely deterministic ignore the inherent variability in turbulent flows that dictates UAM reliability. Thus, advancements in predictive capabilities are needed to help mitigate the unique operational safety risks that microweather winds pose for smaller, lighter weight UAM aircraft. This work aims to model microweather wind velocities in a manner that is computationally-efficient, captures random variability, and would only require a temporary, rather than permanent, field measurement campaign. Inspired by recent breakthroughs in conditional generative AI such as text-to-image generation, the proposed approach learns a probabilistic macro-to-microweather mapping between regional weather forecasts and measured local wind velocities using generative modeling (denoising diffusion probabilistic models, flow matching, and Gaussian mixture models). A simple proof of concept was implemented using a dataset comprised of local (micro) measurements from a Sonic Detection and Ranging (SoDAR) wind profiler along with (macro) forecast data from a nearby weather station over the same time period.

generative model, gmm, macroweather condition, (17 more...)

arXiv.org Artificial Intelligence

2503.0269

Country:

North America > United States > Virginia > Hampton (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
Asia > Singapore (0.04)

Genre: Research Report (0.82)

Industry:

Government > Regional Government > North America Government > United States Government (0.94)
Transportation (0.93)
Energy > Renewable > Wind (0.88)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.34)

Add feedback

DiffExp: Efficient Exploration in Reward Fine-tuning for Text-to-Image Diffusion Models

Chae, Daewon, Choi, June Suk, Kim, Jinkyu, Lee, Kimin

arXiv.org Artificial IntelligenceFeb-19-2025

Fine-tuning text-to-image diffusion models to maximize rewards has proven effective for enhancing model performance. However, reward fine-tuning methods often suffer from slow convergence due to online sample generation. Therefore, obtaining diverse samples with strong reward signals is crucial for improving sample efficiency and overall performance. In this work, we introduce DiffExp, a simple yet effective exploration strategy for reward fine-tuning of text-to-image models. Our approach employs two key strategies: (a) dynamically adjusting the scale of classifier-free guidance to enhance sample diversity, and (b) randomly weighting phrases of the text prompt to exploit high-quality reward signals. We demonstrate that these strategies significantly enhance exploration during online sample generation, improving the sample efficiency of recent reward fine-tuning methods, such as DDPO and AlignProp.

artificial intelligence, fine-tuning, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2502.1407

Country: Asia > Middle East > Republic of Türkiye (0.14)

Genre: Research Report (0.82)

Industry: Energy > Oil & Gas > Upstream (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.71)

Add feedback

Revisit Mixture Models for Multi-Agent Simulation: Experimental Study within a Unified Framework

Lin, Longzhong, Lin, Xuewu, Xu, Kechun, Lu, Haojian, Huang, Lichao, Xiong, Rong, Wang, Yue

arXiv.org Artificial IntelligenceJan-28-2025

Simulation plays a crucial role in assessing autonomous driving systems, where the generation of realistic multi-agent behaviors is a key aspect. In multi-agent simulation, the primary challenges include behavioral multimodality and closed-loop distributional shifts. In this study, we revisit mixture models for generating multimodal agent behaviors, which can cover the mainstream methods including continuous mixture models and GPT-like discrete models. Furthermore, we introduce a closed-loop sample generation approach tailored for mixture models to mitigate distributional shifts. Within the unified mixture model~(UniMM) framework, we recognize critical configurations from both model and data perspectives. We conduct a systematic examination of various model configurations, including positive component matching, continuous regression, prediction horizon, and the number of components. Moreover, our investigation into the data configuration highlights the pivotal role of closed-loop samples in achieving realistic simulations. To extend the benefits of closed-loop samples across a broader range of mixture models, we further address the shortcut learning and off-policy learning issues. Leveraging insights from our exploration, the distinct variants proposed within the UniMM framework, including discrete, anchor-free, and anchor-based models, all achieve state-of-the-art performance on the WOSAC benchmark.

artificial intelligence, mixture model, simulation, (14 more...)

arXiv.org Artificial Intelligence

2501.17015

Country:

Asia > China > Zhejiang Province > Hangzhou (0.04)
Asia > China > Hong Kong (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report > New Finding (1.00)

Industry:

Education (0.68)
Information Technology (0.48)
Transportation > Ground > Road (0.48)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Add feedback

Enhancing Sample Generation of Diffusion Models using Noise Level Correction

Abuduweili, Abulikemu, Yuan, Chenyang, Liu, Changliu, Permenter, Frank

arXiv.org Artificial IntelligenceJan-9-2025

The denoising process of diffusion models can be interpreted as an approximate projection of noisy samples onto the data manifold. Moreover, the noise level in these samples approximates their distance to the underlying manifold. Building on this insight, we propose a novel method to enhance sample generation by aligning the estimated noise level with the true distance of noisy samples to the manifold. Specifically, we introduce a noise level correction network, leveraging a pre-trained denoising network, to refine noise level estimates during the denoising process. Additionally, we extend this approach to various image restoration tasks by integrating task-specific constraints, including inpainting, deblurring, super-resolution, colorization, and compressed sensing. Experimental results demonstrate that our method significantly improves sample quality in both unconstrained and constrained generation scenarios. Notably, the proposed noise level correction framework is compatible with existing denoising schedulers (e.g., DDIM), offering additional performance improvements.

artificial intelligence, machine learning, noise level correction, (13 more...)

arXiv.org Artificial Intelligence

2412.05488

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback