Goto

Collaborating Authors

 noodle


Posha vs. Thermomix: Kitchen Robots Face Off on Thanksgiving Sides

WIRED

The Posha and the Thermomix TM7 are the closest things to a home robot chef that mere mortals can afford. The catch is that you're the prep cook. The holiday is still almost a week away, and I'm sick of Thanksgiving. I've already made four rounds of mashed potatoes, three of mac and cheese, and three turkeys (with more still waiting in my fridge) as part of testing smart probes to help smoke turkeys outside and preparing seven-course holiday meal kits for friends and family. I was eager to finally outsource some of the cooking by testing two very different robo-chef devices, the Thermomix TM7 and the Posha kitchen robot . Both promise to plan my meals and also do most of the cooking, which sounds pretty good to me. The Thermomix descends from a German device launched in 1968--a time when the best-known robot chef was cartoon Rosie on --that was essentially a blender with a heater. It's since caught on big in countries from Italy to Portugal to Australia, and over the years it's added multi-tier steaming, baking, proofing, a touchscreen, an encyclopedic recipe app, and a whole lot of smart features.


Tackling the Noisy Elephant in the Room: Label Noise-robust Out-of-Distribution Detection via Loss Correction and Low-rank Decomposition

Azad, Tarhib Al, Ibrahim, Shahana

arXiv.org Artificial Intelligence

Robust out-of-distribution (OOD) detection is an indispensable component of modern artificial intelligence (AI) systems, especially in safety-critical applications where models must identify inputs from unfamiliar classes not seen during training. While OOD detection has been extensively studied in the machine learning literature--with both post hoc and training-based approaches--its effectiveness under noisy training labels remains underexplored. Recent studies suggest that label noise can significantly degrade OOD performance, yet principled solutions to this issue are lacking. In this work, we demonstrate that directly combining existing label noise-robust methods with OOD detection strategies is insufficient to address this critical challenge. To overcome this, we propose a robust OOD detection framework that integrates loss correction techniques from the noisy label learning literature with low-rank and sparse decomposition methods from signal processing. Extensive experiments on both synthetic and real-world datasets demonstrate that our method significantly outperforms the state-of-the-art OOD detection techniques, particularly under severe noisy label settings.


FLAIR: Feeding via Long-horizon AcquIsition of Realistic dishes

Jenamani, Rajat Kumar, Sundaresan, Priya, Sakr, Maram, Bhattacharjee, Tapomayukh, Sadigh, Dorsa

arXiv.org Artificial Intelligence

Robot-assisted feeding has the potential to improve the quality of life for individuals with mobility limitations who are unable to feed themselves independently. However, there exists a large gap between the homogeneous, curated plates existing feeding systems can handle, and truly in-the-wild meals. Feeding realistic plates is immensely challenging due to the sheer range of food items that a robot may encounter, each requiring specialized manipulation strategies which must be sequenced over a long horizon to feed an entire meal. An assistive feeding system should not only be able to sequence different strategies efficiently in order to feed an entire meal, but also be mindful of user preferences given the personalized nature of the task. We address this with FLAIR, a system for long-horizon feeding which leverages the commonsense and few-shot reasoning capabilities of foundation models, along with a library of parameterized skills, to plan and execute user-preferred and efficient bite sequences. In real-world evaluations across 6 realistic plates, we find that FLAIR can effectively tap into a varied library of skills for efficient food pickup, while adhering to the diverse preferences of 42 participants without mobility limitations as evaluated in a user study. We demonstrate the seamless integration of FLAIR with existing bite transfer methods [19, 28], and deploy it across 2 institutions and 3 robots, illustrating its adaptability. Finally, we illustrate the real-world efficacy of our system by successfully feeding a care recipient with severe mobility limitations. Supplementary materials and videos can be found at: https://emprise.cs.cornell.edu/flair .


Uncertainty-Aware Hardware Trojan Detection Using Multimodal Deep Learning

Vishwakarma, Rahul, Rezaei, Amin

arXiv.org Artificial Intelligence

The risk of hardware Trojans being inserted at various stages of chip production has increased in a zero-trust fabless era. To counter this, various machine learning solutions have been developed for the detection of hardware Trojans. While most of the focus has been on either a statistical or deep learning approach, the limited number of Trojan-infected benchmarks affects the detection accuracy and restricts the possibility of detecting zero-day Trojans. To close the gap, we first employ generative adversarial networks to amplify our data in two alternative representation modalities, a graph and a tabular, ensuring that the dataset is distributed in a representative manner. Further, we propose a multimodal deep learning approach to detect hardware Trojans and evaluate the results from both early fusion and late fusion strategies. We also estimate the uncertainty quantification metrics of each prediction for risk-aware decision-making. The outcomes not only confirms the efficacy of our proposed hardware Trojan detection method but also opens a new door for future studies employing multimodality and uncertainty quantification to address other hardware security challenges.


Prompt Based Tri-Channel Graph Convolution Neural Network for Aspect Sentiment Triplet Extraction

Peng, Kun, Jiang, Lei, Peng, Hao, Liu, Rui, Yu, Zhengtao, Ren, Jiaqian, Hao, Zhifeng, Yu, Philip S.

arXiv.org Artificial Intelligence

Aspect Sentiment Triplet Extraction (ASTE) is an emerging task to extract a given sentence's triplets, which consist of aspects, opinions, and sentiments. Recent studies tend to address this task with a table-filling paradigm, wherein word relations are encoded in a two-dimensional table, and the process involves clarifying all the individual cells to extract triples. However, these studies ignore the deep interaction between neighbor cells, which we find quite helpful for accurate extraction. To this end, we propose a novel model for the ASTE task, called Prompt-based Tri-Channel Graph Convolution Neural Network (PT-GCN), which converts the relation table into a graph to explore more comprehensive relational information. Specifically, we treat the original table cells as nodes and utilize a prompt attention score computation module to determine the edges' weights. This enables us to construct a target-aware grid-like graph to enhance the overall extraction process. After that, a triple-channel convolution module is conducted to extract precise sentiment knowledge. Extensive experiments on the benchmark datasets show that our model achieves state-of-the-art performance. The code is available at https://github.com/KunPunCN/PT-GCN.


Revealed: The best pasta shape for holding sauce - so, how does your favourite stack up?

Daily Mail - Science & tech

With its simple mix of ingredients and high nutritional value, it's no surprise pasta is one of the most popular foods in the world. Despite dating back thousands of years, the age-old question still remains – which pasta shape is the best for holding sauce? To mark World Pasta Day, MailOnline turned to online AI tool ChatGPT for the answer, and it came up with some controversial results. Top of the list was cascatelli, a relatively new pasta from America with a curved shape and distinctive ruffles, deliberately designed to carry sauce. Also in the top six were spaghetti, penne and the'bow tie' pasta farfalle – but an expert claims a lot depends on the type of sauce too.


Learning Sequential Acquisition Policies for Robot-Assisted Feeding

Sundaresan, Priya, Wu, Jiajun, Sadigh, Dorsa

arXiv.org Artificial Intelligence

A robot providing mealtime assistance must perform specialized maneuvers with various utensils in order to pick up and feed a range of food items. Beyond these dexterous low-level skills, an assistive robot must also plan these strategies in sequence over a long horizon to clear a plate and complete a meal. Previous methods in robot-assisted feeding introduce highly specialized primitives for food handling without a means to compose them together. Meanwhile, existing approaches to long-horizon manipulation lack the flexibility to embed highly specialized primitives into their frameworks. We propose Visual Action Planning OveR Sequences (VAPORS), a framework for long-horizon food acquisition. VAPORS learns a policy for high-level action selection by leveraging learned latent plate dynamics in simulation. To carry out sequential plans in the real world, VAPORS delegates action execution to visually parameterized primitives. We validate our approach on complex real-world acquisition trials involving noodle acquisition and bimanual scooping of jelly beans. Across 38 plates, VAPORS acquires much more efficiently than baselines, generalizes across realistic plate variations such as toppings and sauces, and qualitatively appeals to user feeding preferences in a survey conducted across 49 individuals. Code, datasets, videos, and supplementary materials can be found on our website: https://sites.google.com/view/vaporsbot.


Vec2Vec: A Compact Neural Network Approach for Transforming Text Embeddings with High Fidelity

Gao, Andrew Kean

arXiv.org Artificial Intelligence

Vector embeddings have become ubiquitous tools for many language-related tasks. A leading embedding model is OpenAI's text-ada-002 which can embed approximately 6,000 words into a 1,536-dimensional vector. While powerful, text-ada-002 is not open source and is only available via API. We trained a simple neural network to convert open-source 768-dimensional MPNet embeddings into text-ada-002 embeddings. We compiled a subset of 50,000 online food reviews. We calculated MPNet and text-ada-002 embeddings for each review and trained a simple neural network to for 75 epochs. The neural network was designed to predict the corresponding text-ada-002 embedding for a given MPNET embedding. Our model achieved an average cosine similarity of 0.932 on 10,000 unseen reviews in our held-out test dataset. We manually assessed the quality of our predicted embeddings for vector search over text-ada-002-embedded reviews. While not as good as real text-ada-002 embeddings, predicted embeddings were able to retrieve highly relevant reviews. Our final model, Vec2Vec, is lightweight (<80 MB) and fast. Future steps include training a neural network with a more sophisticated architecture and a larger dataset of paired embeddings to achieve greater performance. The ability to convert between and align embedding spaces may be helpful for interoperability, limiting dependence on proprietary models, protecting data privacy, reducing costs, and offline operations.


Beijing introduces the world to 'robo-noodles' to limit COVID spread during the Olympics

FOX News

Gordon Ramsay may not be invited to the Olympics this year. Beijing is focusing on robotic cooks and servers to prepare and serve food to the attendees in the city's Winter Olympics Main Media Center to minimize the spread of COVID-19 and help maximize efficacy, according to a recent Food & Wine report. "The intelligent meal preparation and meal service system here can not only improve the efficiency of meal supply, but also save manpower to the maximum extent and avoid excessive human interaction in the context of epidemic prevention and control," the state-run Xinhua News Agency said. A worker grabs food delivered to a table robotically in the media dining area of the main media center ahead of the 2022 Winter Olympics, Wednesday, Feb. 2, 2022, in Beijing. "The media restaurant will operate 24 hours a day during the competition, providing various dining options such as Chinese food, Western food, and fast food."


Nicolas Babin disruptive week about Artificial Intelligence - August 30th 2021 - Babin Business Consulting

#artificialintelligence

I am regularly asked to summarize my many posts. I thought it would be a good idea to publish on this blog, every Monday, some of the most relevant articles that I have already shared with you on my social networks. Today I will share some of the most relevant articles about Artificial Intelligence and in what form you can find it in today's life. I will also comment on the articles. Can artificial intelligence help scientists spot gravitational waves?