cushion
Mobile Manipulation Instruction Generation from Multiple Images with Automatic Metric Enhancement
Katsumata, Kei, Kambara, Motonari, Yashima, Daichi, Korekata, Ryosuke, Sugiura, Komei
Abstract-- We consider the problem of generating free-form mobile manipulation instructions based on a target object image and receptacle image. Conventional image captioning models are not able to generate appropriate instructions because their architectures are typically optimized for single-image. In this study, we propose a model that handles both the target object and receptacle to generate free-form instruction sentences for mobile manipulation tasks. Moreover, we introduce a novel training method that effectively incorporates the scores from both learning-based and n-gram based automatic evaluation metrics as rewards. This method enables the model to learn the co-occurrence relationships between words and appropriate paraphrases. Therefore, models are required to appropriately handle both images. Hence, these methods are inappropriate essential in a variety of contexts such as elderly care facilities for generating mobile manipulation instructions based on and daily support for disabilities. In particular, the integration multiple images. of service robots in elderly care facilities significantly We propose a model that generates mobile manipulation reduces the burden on caregivers and addresses the growing instructions using a target object image and a receptacle demand driven by the rise in the elderly population.
Apple Vision Pro hands-on, redux: Immersive Video, Disney app, floating keyboard, and a little screaming
With pre-orders for the Apple Vision Pro headset opening this week, the company is getting ready to launch one of its most significant products ever. It announced this morning an "entertainment format pioneered by Apple" called Apple Immersive Video, as well as new viewing environments in the Disney app featuring scenes from the studio's beloved franchises like the Avengers and Star Wars. We already got hands-on once back at WWDC when the headset was first announced, but two of our editors, Dana Wollman and Cherlynn Low, had a chance to go back and revisit the device (and Dana's case, experience it anew). Since we've already walked you through some of the basic UI elements in our earlier piece, we decided to focus on some of the more recently added features, including Apple Immersive Video, the new Disney environments, a built-in "Encounter Dinosaurs" experience, as well as the floating keyboard, which didn't work for us when we first tried the device in June of last year. Here, too, we wanted to really get at what it actually feels like to use the device, from the frustrating to the joyful to the unintentionally eerie.
- Leisure & Entertainment (0.89)
- Media > Film (0.35)
CES 2024 Day 1 recap: Samsung and Sony dominated, as did chips and laptops
The first unofficial day of CES 2024 has come and gone and it feels like we've been run over by a giant metaphorical eighteen-wheeler full of press conferences. From home robots to electric vehicles to AI, laptops and processors, there was news from pretty much all areas of tech. There were pleasant surprises like Samsung's cute new Ballie robot ball and Sony's spatial content creation headset, and intriguing concepts like Razer's vibrating cushion for gamers. We also got exactly what we expected in the form of new processors from the likes of AMD, Intel and NVIDIA, as well as the subsequent flood of laptops carrying the just-announced chips for 2024. And for everyone else, this CES also saw the launch of things like headphones, electric vehicles, gaming handhelds, grills, gaming phones, e-ink tablets, strange hybrid devices, noise-suppressing masks, standing desks and more.
- Transportation > Ground > Road (1.00)
- Transportation > Electric Vehicle (1.00)
Hugging this pulsating cushion apparently suppresses your anxiety
Yukai Engineering, the team behind the strangely adorable cat tail pillow, is back with a new quirky invention. Unveiled at CES 2023, the Fufuly is yet another anxiety-reducing cushion from the Japanese company, but this time featuring a gentle rhythmical pulsation as the main therapeutic tool. The idea is that when you're hugging a Fufuly, its life-like behavior stimulates your belly to induce slower and deeper breathing. Despite its thought-bubble shape (supposedly to evoke the image of a puff of air), the cushion felt more like a mellow creature dozing off in my arms. I even mistook the quiet mechanical noise as a cat's purr, to which CEO Shunsuke Aoki assured me this was purely a coincidence.
Knowledge-driven Scene Priors for Semantic Audio-Visual Embodied Navigation
Tatiya, Gyan, Francis, Jonathan, Bondi, Luca, Navarro, Ingrid, Nyberg, Eric, Sinapov, Jivko, Oh, Jean
Generalisation to unseen contexts remains a challenge for embodied navigation agents. In the context of semantic audio-visual navigation (SAVi) tasks, the notion of generalisation should include both generalising to unseen indoor visual scenes as well as generalising to unheard sounding objects. However, previous SAVi task definitions do not include evaluation conditions on truly novel sounding objects, resorting instead to evaluating agents on unheard sound clips of known objects; meanwhile, previous SAVi methods do not include explicit mechanisms for incorporating domain knowledge about object and region semantics. These weaknesses limit the development and assessment of models' abilities to generalise their learned experience. In this work, we introduce the use of knowledge-driven scene priors in the semantic audio-visual embodied navigation task: we combine semantic information from our novel knowledge graph that encodes object-region relations, spatial knowledge from dual Graph Encoder Networks, and background knowledge from a series of pre-training tasks -- all within a reinforcement learning framework for audio-visual navigation. We also define a new audio-visual navigation sub-task, where agents are evaluated on novel sounding objects, as opposed to unheard clips of known objects. We show improvements over strong baselines in generalisation to unseen regions and novel sounding objects, within the Habitat-Matterport3D simulation environment, under the SoundSpaces task.
- North America > United States > Washington > King County > Seattle (0.04)
- North America > United States > Utah > Salt Lake County > Salt Lake City (0.04)
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
- (10 more...)
Bowers & Wilkins Px7 S2 review: A lot of upgrades at no extra cost
Bowers & Wilkins might be a name many associate with high-end home audio gear, but the company has been steadily chugging along with headphones too. Its latest model, the Px7 S2 ($399), is a completely overhauled version of the PX that debuted in 2019. At every turn, this new model is a worthy upgrade over its predecessor, and you won't have to pay more for the improvements either. For the Px7 S2, Bowers & Wilkins borrowed elements of both the original PX and the Px7 that contribute to the refined look. The company also slimmed down the overall shape and opted for more cushion in the earpads – all while trimming the overall weight.
'Stray' preview: Because you're a cat
Every gameplay mechanic and design decision in Stray is driven by a single idea: Because you're a cat. The world of Stray is filled with anthropomorphic robots, futuristic Hong Kong-inspired streets and makeshift skyscrapers built on heaping piles of trash, and it all serves as a playground for the protagonist, an orange tabby on a mission to escape the city and reunite with its family. And knock over as many delicate objects as possible, of course. In a hands-off preview event for Stray, producer Swann Martin-Raget of BlueTwelve Studio consistently repeated the phrase, "because you're a cat," justifying his decisions to topple various items, jump to precarious ledges and curl up to sleep on top of buildings. "It is required to scratch every wall and sofa because you're a cat and that's very, very important," he said, the orange tabby digging its claws into a random robot's couch cushion. Set pieces that would be afterthoughts in games like Deus Ex: Mankind Divided or Cyberpunk 2077 are integral to Stray's gameplay.
World's first luxury sports hovercraft is revealed
The hovercraft concept is not a new one, having floated drivers across land, sand and water since the 1950s. But 70 years on, now petrol heads finally have a reason to celebrate the vehicle, because what has been billed as the world's first ever luxury sports hovercraft is set to go on sale for $100,000 (£73,400). It may not be as quick as a supercar, with a top speed of around 60mph and 120 miles of range at 40 mph, but it's the fastest amphibious vehicle yet. And unlike a Ferrari, McLaren or Bugatti, this exciting new roadster travels on a cushion of air seven inches off the ground, allowing drivers to whizz across water and giving new meaning to the term off-road vehicle. Designed by the firm Von Mercier, which is named after British engineer and founder Michael Mercier, the Arosa is said to'blend cutting-edge hovercraft and electric vehicle innovation'.
- North America > United States (0.15)
- Oceania > Tonga (0.05)
- Europe > United Kingdom > England > West Sussex (0.05)
- Transportation > Passenger (1.00)
- Transportation > Marine (1.00)
- Transportation > Air (1.00)
- Transportation > Ground > Road (0.54)
A Robot That Finds Your Lost Stuff and More AI-Enabled Gadgets to Come
Researchers at Stanford University have developed a prototype toilet that uses an artificial intelligence-trained camera to track the form of feces and monitor the color and flow of urine. A "lab-on-a-chip" device built into the toilet will analyze micro stool samples to detect viruses like Covid-19 and blood, says Seung-min Park, the lead researcher on the project. This digital diary could yield valuable health insights and facilitate early, noninvasive diagnosis of irritable bowel syndrome or colorectal cancer, Dr. Park says. A look at how innovation and technology are transforming the way we live, work and play. An app would allow users to track health parameters.
Compressing Heavy-Tailed Weight Matrices for Non-Vacuous Generalization Bounds
Heavy-tailed distributions have been studied in statistics, random matrix theory, physics, and econometrics as models of correlated systems, among other domains. Further, heavy-tail distributed eigenvalues of the covariance matrix of the weight matrices in neural networks have been shown to empirically correlate with test set accuracy in several works (e.g. arXiv:1901.08276), but a formal relationship between heavy-tail distributed parameters and generalization bounds was yet to be demonstrated. In this work, the compression framework of arXiv:1802.05296 is utilized to show that matrices with heavy-tail distributed matrix elements can be compressed, resulting in networks with sparse weight matrices. Since the parameter count has been reduced to a sum of the non-zero elements of sparse matrices, the compression framework allows us to bound the generalization gap of the resulting compressed network with a non-vacuous generalization bound. Further, the action of these matrices on a vector is discussed, and how they may relate to compression and resilient classification is analyzed.
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.67)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)