refraction
MARVIS: Motion & Geometry Aware Real and Virtual Image Segmentation
Wu, Jiayi, Lin, Xiaomin, Negahdaripour, Shahriar, Fermüller, Cornelia, Aloimonos, Yiannis
Tasks such as autonomous navigation, 3D reconstruction, and object recognition near the water surfaces are crucial in marine robotics applications. However, challenges arise due to dynamic disturbances, e.g., light reflections and refraction from the random air-water interface, irregular liquid flow, and similar factors, which can lead to potential failures in perception and navigation systems. Traditional computer vision algorithms struggle to differentiate between real and virtual image regions, significantly complicating tasks. A virtual image region is an apparent representation formed by the redirection of light rays, typically through reflection or refraction, creating the illusion of an object's presence without its actual physical location. This work proposes a novel approach for segmentation on real and virtual image regions, exploiting synthetic images combined with domain-invariant information, a Motion Entropy Kernel, and Epipolar Geometric Consistency. Our segmentation network does not need to be re-trained if the domain changes. We show this by deploying the same segmentation network in two different domains: simulation and the real world. By creating realistic synthetic images that mimic the complexities of the water surface, we provide fine-grained training data for our network (MARVIS) to discern between real and virtual images effectively. By motion & geometry-aware design choices and through comprehensive experimental analysis, we achieve state-of-the-art real-virtual image segmentation performance in unseen real world domain, achieving an IoU over 78% and a F1-Score over 86% while ensuring a small computational footprint. MARVIS offers over 43 FPS (8 FPS) inference rates on a single GPU (CPU core). Our code and dataset are available here https://github.com/jiayi-wu-umd/MARVIS.
- North America > United States > Maryland > Prince George's County > College Park (0.14)
- North America > United States > Florida > Miami-Dade County > Coral Gables (0.04)
- Information Technology > Artificial Intelligence > Vision (1.00)
- Information Technology > Artificial Intelligence > Robots (1.00)
- Information Technology > Artificial Intelligence > Natural Language (0.88)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)
Export Reviews, Discussions, Author Feedback and Meta-Reviews
SUMMARY Hamiltonian MCMC methods sample from a probability distribution by treating its log as a "potential energy" function over the state space, augmenting the space with extra "momentum variables" and their associated "kinetic energy", and evolving the state of the Markov process by integrating the physical Hamiltonian equations of motion of the system. Each step of the Markov chain is accomplished by numerically integrating the Hamiltonian equations forward in time. However, if the energy function is non-differentiable, the integral is not well-defined. The rejection step that is used to counteract numerical inaccuracies in the integration also accounts for such non-differentiable regions, but at the cost of slowing down the mixing rate of the Markov chain. This paper suggests physically-inspired "reflections" and "refractions" of the trajectory of the system that occur whenever the state crosses a discontinuity in the energy function. It applies to target distributions that are differentiable everywhere except on the boundaries of certain polytopes; the reflection or refraction occurs whenever the trajectory of the system crosses such a boundary.
Reflection, Refraction, and Hamiltonian Monte Carlo
Hamiltonian Monte Carlo (HMC) is a successful approach for sampling from continuous densities. However, it has difficulty simulating Hamiltonian dynamics with non-smooth functions, leading to poor performance. This paper is motivated by the behavior of Hamiltonian dynamics in physical systems like optics. We introduce a modification of the Leapfrog discretization of Hamiltonian dynamics on piecewise continuous energies, where intersections of the trajectory with discontinuities are detected, and the momentum is reflected or refracted to compensate for the change in energy. We prove that this method preserves the correct stationary distribution when boundaries are affine. Experiments show that by reducing the number of rejected samples, this method improves on traditional HMC.
UWA360CAM: A 360$^{\circ}$ 24/7 Real-Time Streaming Camera System for Underwater Applications
Pham, Quan-Dung, Zhu, Yipeng, Ha, Tan-Sang, Nguyen, K. H. Long, Hua, Binh-Son, Yeung, Sai-Kit
Omnidirectional camera is a cost-effective and information-rich sensor highly suitable for many marine applications and the ocean scientific community, encompassing several domains such as augmented reality, mapping, motion estimation, visual surveillance, and simultaneous localization and mapping. However, designing and constructing such a high-quality 360$^{\circ}$ real-time streaming camera system for underwater applications is a challenging problem due to the technical complexity in several aspects including sensor resolution, wide field of view, power supply, optical design, system calibration, and overheating management. This paper presents a novel and comprehensive system that addresses the complexities associated with the design, construction, and implementation of a fully functional 360$^{\circ}$ real-time streaming camera system specifically tailored for underwater environments. Our proposed system, UWA360CAM, can stream video in real time, operate in 24/7, and capture 360$^{\circ}$ underwater panorama images. Notably, our work is the pioneering effort in providing a detailed and replicable account of this system. The experiments provide a comprehensive analysis of our proposed system.
A Lagrangian Formulation For Optical Backpropagation Training In Kerr-Type Optical Networks
A training method based on a form of continuous spatially distributed optical error back-propagation is presented for an all optical network composed of nondiscrete neurons and weighted interconnections. The all optical network is feed-forward and is composed of thin layers of a Kerr(cid:173) type self focusing/defocusing nonlinear optical material. The training method is derived from a Lagrangian formulation of the constrained minimization of the network error at the output. This leads to a formulation that describes training as a calculation of the distributed error of the optical signal at the output which is then reflected back through the device to assign a spatially distributed error to the internal layers. This error is then used to modify the internal weighting values.
An Additive Latent Feature Model for Transparent Object Recognition
Existing methods for recognition of object instances and categories based on quantized local features can perform poorly when local features exist on transparent surfaces, such as glass or plastic objects. There are characteristic patterns to the local appearance of transparent objects, but they may not be well captured by distances to individual examples or by a local pattern codebook obtained by vector quantization. The appearance of a transparent patch is determined in part by the refraction of a background pattern through a transparent medium: the energy from the background usually dominates the patch appearance. We model transparent local patch appearance using an additive model of latent factors: background factors due to scene content, and factors which capture a local edge energy distribution characteristic of the refraction. We implement our method using a novel LDA-SIFT formulation which performs LDA prior to any vector quantization step; we discover latent topics which are characteristic of particular transparent patches and quantize the SIFT space into transparent visual words according to the latent topic dimensions.
Sampling Neural Radiance Fields for Refractive Objects
Pan, Jen-I, Su, Jheng-Wei, Hsiao, Kai-Wen, Yen, Ting-Yu, Chu, Hung-Kuo
Recently, differentiable volume rendering in neural radiance fields (NeRF) has gained a lot of popularity, and its variants have attained many impressive results. However, existing methods usually assume the scene is a homogeneous volume so that a ray is cast along the straight path. In this work, the scene is instead a heterogeneous volume with a piecewise-constant refractive index, where the path will be curved if it intersects the different refractive indices. For novel view synthesis of refractive objects, our NeRF-based framework aims to optimize the radiance fields of bounded volume and boundary from multi-view posed images with refractive object silhouettes. To tackle this challenging problem, the refractive index of a scene is reconstructed from silhouettes. Given the refractive index, we extend the stratified and hierarchical sampling techniques in NeRF to allow drawing samples along a curved path tracked by the Eikonal equation. The results indicate that our framework outperforms the state-of-the-art method both quantitatively and qualitatively, demonstrating better performance on the perceptual similarity metric and an apparent improvement in the rendering quality on several synthetic and real scenes.
- Asia > Taiwan (0.05)
- Asia > South Korea > Daegu > Daegu (0.05)
- North America > United States > New York > New York County > New York City (0.04)
- Europe > Italy > Tuscany > Pisa Province > Pisa (0.04)
Delivery startup Refraction AI raises $4.2M to expand service areas
Refraction AI, a company developing semi-autonomous delivery robots, today announced that it raised $4.2 million in seed funding led by Pillar VC. Refraction says that the proceeds will be used for customer acquisition, geographic expansion, and product development well into the next year. The worsening COVID-19 health crisis in much of the U.S. seems likely to hasten the adoption of self-guided robots and drones for goods transportation. They require disinfection, which companies like Kiwibot, Starship Technologies, and Postmates are conducting manually with sanitation teams. But in some cases, delivery rovers like Refraction's could minimize the risk of spreading disease.
Council Post: Connecting Everything For The Ultimate Experience
Lately, I feel like my home-office chair doesn't understand me, which is disappointing given all the time we're spending hunkered down together. If only my chair was aware of my current situation: endless hours of videoconferencing, limited exercise, fitful sleep, a stiff back. Any time our regular routine undergoes a major disruption, everything feels off-kilter. Working from home is a big change for many of us, but opportunities exist for new experiences enabled by an array of increasingly smart devices. What if these devices could sense something was out of balance and collaborated on how to make us feel better?
- Information Technology > Communications > Networks (1.00)
- Information Technology > Artificial Intelligence (0.72)