AITopics | pixel value

We consider the training of the first layer of vision models and notice the clear relationship between pixel values and gradient update magnitudes: the gradients arriving at the weights of a first layer are by definition directly proportional to (normalized) input pixel values. Thus, an image with low contrast has a smaller impact on learning than an image with higher contrast, and a very bright or very dark image has a stronger impact on the weights than an image with moderate brightness. In this work, we propose performing gradient descent on the embeddings produced by the first layer of the model. However, switching to discrete inputs with an embedding layer is not a reasonable option for vision models. Thus, we propose the conceptual procedure of (i) a gradient descent step on first layer activations to construct an activation proposal, and (ii) finding the optimal weights of the first layer, i.e., those weights which minimize the squared distance to the activation proposal. We provide a closed form solution of the procedure and adjust it for robust stochastic training while computing everything efficiently. Empirically, we find that TrAct (Training Activations) speeds up training by factors between 1.25x and 4x while requiring only a small computational overhead. We demonstrate the utility of TrAct with different optimizers for a range of different vision models including convolutional and transformer architectures.

artificial intelligence, machine learning, proceedings, (6 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.82)

Add feedback

2439ec22091b9d6cfbebf3284b40116e-Paper-Conference.pdf

Neural Information Processing SystemsFeb-18-2026, 23:36:34 GMT

intersection, sketch, vector, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Vision (0.70)

Add feedback

d1422213c9f2bdd5178b77d166fba86a-Paper-Conference.pdf

Neural Information Processing SystemsFeb-17-2026, 06:27:58 GMT

artificial intelligence, machine learning, spp count, (14 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Zürich > Zürich (0.04)
Asia > Middle East > Jordan (0.04)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Graphics (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.68)

Add feedback

7fc36bce5de315751001981baaf4751a-Supplemental-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsFeb-15-2026, 12:43:58 GMT

artificial intelligence, dataset, machine learning, (15 more...)

Neural Information Processing Systems

Country: Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

The continuous Bernoulli: fixing a pervasive error in variational autoencoders

Gabriel Loaiza-Ganem, John P. Cunningham

Neural Information Processing SystemsFeb-15-2026, 04:48:17 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, bernoulli vae, machine learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Asia > Middle East > Jordan (0.04)
North America > United States > Colorado (0.04)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

6248a3b8279a39b3668a8a7c0e29164d-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-9-2026, 09:45:45 GMT

dataset, dood, model generalize, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)

Add feedback

6e7d5d259be7bf56ed79029c4e621f44-Paper.pdf

Neural Information Processing SystemsFeb-9-2026, 07:06:14 GMT

implicit position, pixel value, screen content image, (10 more...)

Neural Information Processing Systems

Country: Asia > China > Tianjin Province > Tianjin (0.04)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)

Add feedback

Implicit Transformer Network for Screen Content Image Continuous Super-Resolution

Neural Information Processing SystemsDec-24-2025, 06:47:25 GMT

Nowadays, there is an explosive growth of screen contents due to the wide application of screen sharing, remote cooperation, and online education. To match the limited terminal bandwidth, high-resolution (HR) screen contents may be downsampled and compressed. At the receiver side, the super-resolution (SR)of low-resolution (LR) screen content images (SCIs) is highly demanded by the HR display or by the users to zoom in for detail observation. However, image SR methods mostly designed for natural images do not generalize well for SCIs due to the very different image characteristics as well as the requirement of SCI browsing at arbitrary scales. To this end, we propose a novel Implicit Transformer Super-Resolution Network (ITSRN) for SCISR. For high-quality continuous SR at arbitrary ratios, pixel values at query coordinates are inferred from image features at key coordinates by the proposed implicit transformer and an implicit position encoding scheme is proposed to aggregate similar neighboring pixel values to the query one. We construct benchmark SCI1K and SCI1K-compression datasets withLR and HR SCI pairs. Extensive experiments show that the proposed ITSRN significantly outperforms several competitive continuous and discrete SR methods for both compressed and uncompressed SCIs.

implicit transformer network, name change, screen content image continuous super-resolution, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.63)

Add feedback

Filters

Collaborating Authors

pixel value

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

d1422213c9f2bdd5178b77d166fba86a-Paper-Conference.pdf

2439ec22091b9d6cfbebf3284b40116e-Paper-Conference.pdf

TrAct: Making First-layer Pre-Activations Trainable

2439ec22091b9d6cfbebf3284b40116e-Paper-Conference.pdf

d1422213c9f2bdd5178b77d166fba86a-Paper-Conference.pdf

7fc36bce5de315751001981baaf4751a-Supplemental-Datasets_and_Benchmarks.pdf

The continuous Bernoulli: fixing a pervasive error in variational autoencoders

6248a3b8279a39b3668a8a7c0e29164d-Supplemental-Conference.pdf

6e7d5d259be7bf56ed79029c4e621f44-Paper.pdf

Implicit Transformer Network for Screen Content Image Continuous Super-Resolution