AITopics | Asia

Reconstructing the Image Stitching Pipeline: Integrating Fusion and Rectangling into a Unified Inpainting Model

Neural Information Processing SystemsApr-30-2026, 13:40:27 GMT

Deep learning-based image stitching pipelines are typically divided into three cascading stages: registration, fusion, and rectangling. Each stage requires its own network training and is tightly coupled to the others, leading to error propagation and posing significant challenges to parameter tuning and system stability. This paper proposes the Simple and Robust Stitcher (SRStitcher), which revolutionizes the image stitching pipeline by simplifying the fusion and rectangling stages into a unified inpainting model, requiring no model training or fine-tuning. We reformulate the problem definitions of the fusion and rectangling stages and demonstrate that they can be effectively integrated into an inpainting task. Furthermore, we design the weighted masks to guide the reverse process in a pre-trained largescale diffusion model, implementing this integrated inpainting task in a single inference. Through extensive experimentation, we verify the interpretability and generalization capabilities of this unified model, demonstrating that SRStitcher outperforms state-of-the-art methods in both performance and stability.

artificial intelligence, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Country: Asia (0.67)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.93)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Was Israeli PM's Lebanon destruction video a snub to Trump?

Al JazeeraApr-30-2026, 13:35:15 GMT

Why is Israel still in southern Lebanon? A war to shape Lebanon's future Was Israeli PM's Lebanon destruction video a snub to Trump? NewsFeed Was Israeli PM's Lebanon destruction video a snub to Trump? Hours after US President Donald Trump asked Benjamin Netanyahu to stop destroying buildings in Lebanon as it "makes Israel look bad", the Israeli prime minister published a montage of forces blowing up infrastructure across southern Lebanon.

artificial intelligence, live navigation menu news show, news section africa asia us, (6 more...)

Al Jazeera

Country: Asia > Middle East > Lebanon (1.00)

Industry: Government > Regional Government > North America Government > United States Government (0.58)

Technology: Information Technology > Artificial Intelligence (0.35)

Add feedback

Humanoid robots being trialled as airport workers in Japan

Al JazeeraApr-30-2026, 10:56:41 GMT

Japan Airlines says it will trial humanoid robots as workers at Tokyo's Haneda Airport, with tasks including baggage handling and cabin cleaning.

artificial intelligence, live navigation menu news show, video duration 00, (6 more...)

Al Jazeera

Country:

North America > United States (0.71)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.27)
Asia > Middle East > Iran (0.19)
Asia > Middle East > Israel (0.15)

Industry:

Transportation > Infrastructure & Services > Airport (0.77)
Transportation > Air (0.59)

Technology: Information Technology > Artificial Intelligence > Robots > Humanoid Robots (0.68)

Add feedback

Appendix AVariational Paragraph Embedder A.1 Selection of substitution rate p

Neural Information Processing SystemsApr-30-2026, 10:10:09 GMT

Figure 4: Impact of the proportion of injected noise for learning Paragraph Embeddings on XSum dataset. PPLint and the PPL of the generation obtained from training PLANNER on the corresponding z at different noise level. We observed when the value of p is within (0, 0.7), there Performing a grid search on each task using diffusion models is an expensive process. However, it has been observed that an increase in the value of p leads to a deviation between the two. This could be attributed to a higher conversion error that occurs when p is excessively large. A.2 Selection of number of latent code k The parameter k determines the number of latent codes used to represent a paragraph and therefore controls the compression level. Latent codes with smaller values of k are easier to model using the diffusion model, but may struggle to accurately preserve all the information in the original text. Additionally, smaller values of k offer computational efficiency as the sequence length for the diffusion model is k. To determine the best set of latent codes, we conducted experiments using three different methods: 1) selecting the first k hidden vectors, 2) selecting the last k hidden vectors, and 3) selecting interleaving hidden vectors, one for every L k hidden vectors. The results of the ablation study are presented in Table 5. Based on our findings, we observed no significant difference among the different choices, so we opted for option 1). Furthermore, we discovered that increasing the value of k does not lead to a dramatic improvement in performance. To balance between efficiency and performance, in most of our study we only use k =16 Setup BLEU_clean BLEU_robust First k (k=16) 79.59 43.17 A.3 Reconstruction, denoising and interpolation examples In Table 6, we present examples that demonstrate the adeptness of the trained Variational Paragraph Embedder in providing clean and denoised reconstructions. Additionally, we showcase interpolation results (Table 7, 8) derived from two random sentences in the hotel review dataset. The interpolated paragraph is usually coherent and incorporates inputs from both sentences, characterizing the distributional smoothness of the latent space. Reconstructed text complaints: after two nights stay, i asked the maid to clean our room (empty the wastebasket & make the bed). Denoising reconstruction (hotel review), noise level 0.3 Original text * * * check out the bathroom picture * * * i was in nyc by myself to watch some friends participate in the us olympic marathon trials. Corrupted text * * [unused697] check exams the bathroom picture * * slams i was in nyc mead myself yankee 2016 some scotch ruin in the outfielder olympicnca trials.

artificial intelligence, hotel, machine learning, (16 more...)

Neural Information Processing Systems

Country:

Asia (0.93)
North America > United States > Maryland > Prince George's County (0.28)

Genre: Research Report > New Finding (0.86)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Consumer Products & Services (1.00)
Health & Medicine (0.93)
(6 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

fd02779b6c8885efc69bab6dd9571cee-Paper-Conference.pdf

Neural Information Processing SystemsApr-30-2026, 09:56:06 GMT

artificial intelligence, iteration, machine learning, (19 more...)

Neural Information Processing Systems

Country: Asia > China (0.46)

Genre: Research Report (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)

Add feedback

fc8ee7c7ab5b5f6b1615045dfb617ed6-Paper-Conference.pdf

Neural Information Processing SystemsApr-30-2026, 09:54:31 GMT

artificial intelligence, machine learning, reinforcement learning, (12 more...)

Neural Information Processing Systems

Country:

North America > United States (0.28)
Asia > China (0.28)
Europe > France (0.28)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Robots (0.93)

Add feedback

fc657b7fd7b9aaa462f2ef9f0362b273-Paper-Conference.pdf

Neural Information Processing SystemsApr-30-2026, 09:53:41 GMT

artificial intelligence, machine learning, representation, (17 more...)

Neural Information Processing Systems

Country: Asia (0.28)

Genre:

Research Report (0.67)
Overview (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

Global Structure-Aware Diffusion Process for Low-Light Image Enhancement

Neural Information Processing SystemsApr-30-2026, 09:41:04 GMT

This paper studies a diffusion-based framework to address the low-light image enhancement problem.

artificial intelligence, machine learning, survey article, (16 more...)

Neural Information Processing Systems

Country: Asia > China (0.14)

Genre:

Research Report (0.66)
Overview (0.48)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

fba4a59c7a569fce120eea9aa9227052-Paper-Conference.pdf

Neural Information Processing SystemsApr-30-2026, 09:39:40 GMT

data mining, machine learning, node, (15 more...)

Neural Information Processing Systems

Country:

Europe (1.00)
North America > United States > California (0.92)
Asia (0.68)

Genre: Research Report (0.46)

Industry: Information Technology (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Communications (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.93)

Add feedback

RangePerception: Taming LiDARRange View for Efficient and Accurate 3DObject Detection

Neural Information Processing SystemsApr-30-2026, 09:28:31 GMT

LiDAR-based 3D detection methods currently use bird's-eye view (BEV) or range view (RV) as their primary basis. The former relies on voxelization and 3D convolutions, resulting in inefficient training and inference processes. Conversely, RV-based methods demonstrate higher efficiency due to their compactness and compatibility with 2D convolutions, but their performance still trails behind that of BEV-based methods. To eliminate this performance gap while preserving the efficiency of RV-based methods, this study presents an efficient and accurate RV-based 3D object detection framework termed RangePerception. Through meticulous analysis, this study identifies two critical challenges impeding the performance of existing RV-based methods: 1) there exists a natural domain gap between the 3D world coordinate used in output and 2D range image coordinate used in input, generating difficulty in information extraction from range images; 2) native range images suffer from vision corruption issue, affecting the detection accuracy of the objects located on the margins of the range images. To address the key challenges above, we propose two novel algorithms named Range Aware Kernel (RAK) and Vision Restoration Module (VRM), which facilitate information flow from range image representation and world-coordinate 3D detection results. With the help of RAK and VRM, our RangePerception achieves 3.25/4.18

artificial intelligence, machine learning, rangeperception, (12 more...)

Neural Information Processing Systems

Country: Asia > China (0.28)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

Filters

Collaborating Authors

Asia

Reconstructing the Image Stitching Pipeline: Integrating Fusion and Rectangling into a Unified Inpainting Model

Was Israeli PM's Lebanon destruction video a snub to Trump?

Humanoid robots being trialled as airport workers in Japan

Appendix AVariational Paragraph Embedder A.1 Selection of substitution rate p

fd02779b6c8885efc69bab6dd9571cee-Paper-Conference.pdf

fc8ee7c7ab5b5f6b1615045dfb617ed6-Paper-Conference.pdf

fc657b7fd7b9aaa462f2ef9f0362b273-Paper-Conference.pdf

Global Structure-Aware Diffusion Process for Low-Light Image Enhancement

fba4a59c7a569fce120eea9aa9227052-Paper-Conference.pdf

RangePerception: Taming LiDARRange View for Efficient and Accurate 3DObject Detection