Goto

Collaborating Authors

 image reconstruction


Center Smoothing: Certified Robustness for Networks with Structured Outputs Appendix

Neural Information Processing Systems

Let, y be a point in that intersection. Since, by definition, ห†r(x0,) is the radius of the smallest ball with 1/2 + probability mass of f(x0 + P) over all possible centers in Rk and ห†Ris the radius of the smallest such ball centered at ห†f(x), we must have ห†r(x0,) ห†R. Consider the smallest ball B(z0,ห†r(x, 1)) that encloses at least 1/2 + 1 probability mass of f(x+ P). Since, r is the radius of the minimum enclosing ball that contains at least half of the points in Z, we have r ห†r(x, 1). Now, using the definition of ห†Rand following the same reasoning as theorem 2, we can say that, d( ห†f(x), ห†f(x0)) ฮฒห†r(x0,) + ห†R (1 + ฮฒ) ห†R.


Focus On What Matters: Separated Models For Visual-Based RL Generalization

Neural Information Processing Systems

A primary challenge for visual-based Reinforcement Learning (RL) is to generalize effectively across unseen environments. Although previous studies have explored different auxiliary tasks to enhance generalization, few adopt image reconstruction due to concerns about exacerbating overfitting to task-irrelevant features during training. Perceiving the pre-eminence of image reconstruction in representation learning, we propose SMG (\blue{S}eparated \blue{M}odels for \blue{G}eneralization), a novel approach that exploits image reconstruction for generalization. SMG introduces two model branches to extract task-relevant and task-irrelevant representations separately from visual observations via cooperatively reconstruction. Built upon this architecture, we further emphasize the importance of task-relevant features for generalization. Specifically, SMG incorporates two additional consistency losses to guide the agent's focus toward task-relevant areas across different scenarios, thereby achieving free from overfitting. Extensive experiments in DMC demonstrate the SOTA performance of SMG in generalization, particularly excelling in video-background settings. Evaluations on robotic manipulation tasks further confirm the robustness of SMG in real-world applications.






A Appendix

Neural Information Processing Systems

KAN oversaw the project and contributed valuable feedback. MindEye was developed using a training and validation set of Subject 1's data, with the test set (and other subjects' data) untouched until final PyTorch code for the MLP backbone and projector is depicted in Algorithm 1. Specifics on how we DALL-E 2. This makes our prior much faster at inference time. For simplicity we use bidirectional attention in our final model. To map to Stable Diffusion's V AE latent space we use a low-level pipeline with the same architecture as the high level pipeline. Recent works in low-level vision (super-resolution, denoising, deblurring, etc.) have observed that This performs worse than only applying the loss in latent space and also requires significantly more GPU memory.




f-DivergenceVariationalInference

Neural Information Processing Systems

For decades, the dominant paradigm for approximate Bayesian inferencep(z|x) = p(z,x)/p(x) has been Markov-Chain Monte-Carlo (MCMC) algorithms, which estimate the evidencep(x) = R p(z,x)dz via sampling. However, since sampling tends to be a slow and computationally intensive process, these sampling-based approximate inference methods fadewhendealing withthemodern probabilistic machine learning problems that usually involveverycomplexmodels, high-dimensional feature spaces andlargedatasets.