Media
Both of these influencers are successful - but only one is human
In some ways, Gigi is like any other young social media influencer. With perfect hair and makeup, she logs on and talks to her fans. She shares clips: eating, doing skin care, putting on lipstick. She even has a cute baby who appears in some videos. But after a few seconds, something may seem a little off.
The Drunkard's Odometry: Estimating Camera Motion in Deforming Scenes
Estimating camera motion in deformable scenes poses a complex and open research challenge. Most existing non-rigid structure from motion techniques assume to observe also static scene parts besides deforming scene parts in order to establish an anchoring reference. However, this assumption does not hold true in certain relevant application cases such as endoscopies. Deformable odometry and SLAM pipelines, which tackle the most challenging scenario of exploratory trajectories, suffer from a lack of robustness and proper quantitative evaluation methodologies. To tackle this issue with a common benchmark, we introduce the Drunkard's Dataset, a challenging collection of synthetic data targeting visual navigation and reconstruction in deformable environments. This dataset is the first large set of exploratory camera trajectories with ground truth inside 3D scenes where every surface exhibits non-rigid deformations over time. Simulations in realistic 3D buildings lets us obtain a vast amount of data and ground truth labels, including camera poses, RGB images and depth, optical flow and normal maps at high resolution and quality. We further present a novel deformable odometry method, dubbed the Drunkard's Odometry, which decomposes optical flow estimates into rigid-body camera motion and non-rigid scene deformations. In order to validate our data, our work contains an evaluation of several baselines as well as a novel tracking error metric which does not require ground truth data.
Simple and Controllable Music Generation
We tackle the task of conditional music generation. We introduce MusicGen, a single Language Model (LM) that operates over several streams of compressed discrete music representation, i.e., tokens. Unlike prior work, MusicGen is comprised of a single-stage transformer LM together with efficient token interleaving patterns, which eliminates the need for cascading several models, e.g., hierarchically or upsampling. Following this approach, we demonstrate how MusicGen can generate high-quality samples, both mono and stereo, while being conditioned on textual description or melodic features, allowing better controls over the generated output. We conduct extensive empirical evaluation, considering both automatic and human studies, showing the proposed approach is superior to the evaluated baselines on a standard text-to-music benchmark. Through ablation studies, we shed light over the importance of each of the components comprising MusicGen.
Tiny wild cat spotted in Thailand for first time in 30 years
The flat-headed felines are the smallest wild cats in Southeast Asia. New images from Thailand's DNP and Panthera prove the existence and rediscovery of one of the world's most Endangered and least known wild cats, the flat-headed cat, in Thailand's Princess Sirindhorn Wildlife Sanctuary. Breakthroughs, discoveries, and DIY tips sent every weekday. Camera traps in Thailand have captured adorable passersby with significant implication for the country's conservation efforts. While these furry creatures might look like your average house cat, they're actually wild flat-headed cats ().