AITopics | patch size

Collaborating Authors

patch size

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

e4667dd0a5a54b74019b72b677ed8ec1-Paper-Conference.pdf

Neural Information Processing SystemsApr-30-2026, 02:39:48 GMT

W Dif nificantly e fusion propose reduce models Patch the are Dif training po fusion werful,, time a generic but costs they while patch-wise require impro a lot training ving of time data frame ef and ficienc w data ork, y to, to which train.

artificial intelligence, diffusion model, machine learning, (14 more...)

Neural Information Processing Systems

Industry: Media (0.93)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Masked Generative Adversarial Networks are Data-Efficient Generation Learners Supplemental Materials

Neural Information Processing SystemsApr-24-2026, 14:15:19 GMT

Prior studies [18, 12] show that GAN often experiences generation failures with severely degraded generation performance when only limited training data is available. Specifically, with limited training data, the discriminator tends to discriminate via meaningless shortcuts by merely focusing on easy-to-discriminate image locations and spectra instead of holistic understanding of images. This can be viewed clearly in Figure 1, where the Gini Coefficient [4] of discriminator's spatial attentions increases quickly along the training iteration (when only limited training data is available). Note that the Gini coefficient [4] is negatively correlated with equality, i.e., the discriminator will pay more unevenly distributed attention to each spatial location while the Gini coefficient increases from '0' to '1'. For image generation with GAN, the large Gini coefficient (of discriminator's spatial attentions) thus means that the discriminator starts to focus on certain spatial locations (easy to discriminate) while ignoring other spatial locations (hard to discriminate), ultimately leading to an over-confident discriminator and training collapse. In another word, the Gini coefficient [4] of '0' expresses perfect equality where all values are the same (i.e., where the discriminator pays the same attention to every spatial location) while '1' expresses maximal inequality among values (i.e., the discriminator focuses on only one location while all others are ignored).

artificial intelligence, machine learning, maskedgan, (16 more...)

Neural Information Processing Systems

Country: Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

01c561df365429f33fcd7a7faa44c985-Supplemental-Conference.pdf

Neural Information Processing SystemsApr-24-2026, 07:37:40 GMT

A.1 Datasets fMoWRGBFunctional Map of the World (fMoW) [17] is a dataset of high-resolution satellite image time series across the world, with a task of classification among 62 architecture categories such as airport, shipyard, and zoo. The license is provided here 2. Co-located images of different timestamps, or sequences, are provided in fMoW. They are of different length, and around 60% of the samples have length larger than 2. Readers can refer to the fMoW paper [17] for statistics on the distribution of sequence lengths. We construct a temporal version of fMoW by randomly associating every single image with two images of the same location but of different timestamps if possible. For a given spatial location loc, we define Tloc as the number of temporally distinct snapshots present in the dataset. We crop surface reflectance images from the Sentinel-2 (ESA) satellite (courtesy of the U.S. Geological Survey), consisting of 90-day composites of images at the same locations as fMoW images (to reduce the impacts of cloud coverage). At each fMoW datapoint location, we collect a time series of Sentinel-2 images, using the provided geo-coordinate bounding boxes.

artificial intelligence, image understanding, machine learning, (19 more...)

Neural Information Processing Systems

Country: North America > United States (0.88)

Industry: Information Technology (0.95)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.95)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.48)

Add feedback

cb3213ada48302953cb0f166464ab356-Supplemental.pdf

Neural Information Processing SystemsFeb-19-2026, 09:12:10 GMT

classifier, dataset, patch size, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

d7aa002885ccbe68cf6880da583761b2-Paper-Conference.pdf

Neural Information Processing SystemsFeb-18-2026, 08:03:47 GMT

forecasting, large language model, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States (0.28)
Asia > China > Beijing > Beijing (0.04)
Asia > China > Guangdong Province > Guangzhou (0.04)
Asia > China > Hong Kong (0.04)

Genre: Research Report > New Finding (0.93)

Industry:

Health & Medicine (0.46)
Energy > Power Industry (0.46)
Transportation (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.93)

Add feedback

f8f78f8043f35890181a824e53a57134-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-18-2026, 01:21:22 GMT

artificial intelligence, patch size, raster scan, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.72)

Add feedback

Modeling Million-byte Sequences with Multiscale Transformers Lili Y u Dániel Simig Colin Flaherty Armen Aghajanyan Luke Zettlemoyer Mike Lewis Meta AI

Neural Information Processing SystemsFeb-18-2026, 01:21:18 GMT

Sequences of millions of bytes are ubiquitous; for example, music, image, or video files typically consist of multiple megabytes.

large language model, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Genre: Research Report (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Patch Diffusion: Faster and More Data-Efficient Training of Diffusion Models

Neural Information Processing SystemsFeb-17-2026, 15:45:19 GMT

Diffusion models are powerful, but they require a lot of time and data to train. We propose Patch Diffusion, a generic patch-wise training framework, to significantly reduce the training time costs while improving data efficiency, which thus helps democratize diffusion model training to broader users. At the core of our innovations is a new conditional score function at the patch level, where the patch location in the original image is included as additional coordinate channels, while the patch size is randomized and diversified throughout training to encode the cross-region dependency at multiple scales. Sampling with our method is as easy as in the original diffusion model.

artificial intelligence, diffusion model, machine learning, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Texas > Travis County > Austin (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)

Industry: Media (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Appendix

Neural Information Processing SystemsFeb-16-2026, 09:05:14 GMT

For the rest 10k iterations, we further finetune the shared volume and the RGB branch.

artificial intelligence, evaluation, experiment, (15 more...)

Neural Information Processing Systems

Country: Asia > Japan > Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.05)

Technology: Information Technology > Artificial Intelligence > Vision (0.30)

Add feedback

Scaling transformer neural networks for skillful and reliable medium-range weather forecasting Tung Nguyen

Neural Information Processing SystemsFeb-16-2026, 03:16:07 GMT

Recently, data-driven approaches for weather forecasting based on deep learning have shown great promise, achieving accuracies that are competitive with operational systems. However, those methods often employ complex, customized architectures without sufficient ablation analysis, making it difficult to understand what truly contributes to their success.

artificial intelligence, machine learning, modeling & simulation, (20 more...)

Neural Information Processing Systems

Country: