pasta
PASTA: A Unified Framework for Offline Assortment Learning
Dong, Juncheng, Mo, Weibin, Qi, Zhengling, Shi, Cong, Fang, Ethan X., Tarokh, Vahid
We study a broad class of assortment optimization problems in an offline and data-driven setting. In such problems, a firm lacks prior knowledge of the underlying choice model, and aims to determine an optimal assortment based on historical customer choice data. The combinatorial nature of assortment optimization often results in insufficient data coverage, posing a significant challenge in designing provably effective solutions. To address this, we introduce a novel Pessimistic Assortment Optimization (PASTA) framework that leverages the principle of pessimism to achieve optimal expected revenue under general choice models. Notably, PASTA requires only that the offline data distribution contains an optimal assortment, rather than providing the full coverage of all feasible assortments. Theoretically, we establish the first finite-sample regret bounds for offline assortment optimization across several widely used choice models, including the multinomial logit and nested logit models. Additionally, we derive a minimax regret lower bound, proving that PASTA is minimax optimal in terms of sample and model complexity. Numerical experiments further demonstrate that our method outperforms existing baseline approaches.
- North America > United States > New York > Tompkins County > Ithaca (0.04)
- North America > United States > Illinois > Cook County > Evanston (0.04)
- North America > United States > California > Alameda County > Berkeley (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
PASTA: Part-Aware Sketch-to-3D Shape Generation with Text-Aligned Prior
Lee, Seunggwan, Jung, Hwanhee, Koh, Byoungsoo, Huang, Qixing, Yoon, Sangho, Kim, Sangpil
A fundamental challenge in conditional 3D shape generation is to minimize the information loss and maximize the intention of user input. Existing approaches have predominantly focused on two types of isolated conditional signals, i.e., user sketches and text descriptions, each of which does not offer flexible control of the generated shape. In this paper, we introduce PASTA, the flexible approach that seamlessly integrates a user sketch and a text description for 3D shape generation. The key idea is to use text embeddings from a vision-language model to enrich the semantic representation of sketches. Specifically, these text-derived priors specify the part components of the object, compensating for missing visual cues from ambiguous sketches. In addition, we introduce ISG-Net which employs two types of graph convolutional networks: IndivGCN, which processes fine-grained details, and PartGCN, which aggregates these details into parts and refines the structure of objects. Extensive experiments demonstrate that PASTA outperforms existing methods in part-level editing and achieves state-of-the-art results in sketch-to-3D shape generation.
- North America > United States > Texas > Travis County > Austin (0.04)
- North America > United States > Texas > Schleicher County (0.04)
- North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
- North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Is Multiple Object Tracking a Matter of Specialization?
Mancusi, Gianluca, Bernardi, Mattia, Panariello, Aniello, Porrello, Angelo, Cucchiara, Rita, Calderara, Simone
End-to-end transformer-based trackers have achieved remarkable performance on most human-related datasets. However, training these trackers in heterogeneous scenarios poses significant challenges, including negative interference - where the model learns conflicting scene-specific parameters - and limited domain generalization, which often necessitates expensive fine-tuning to adapt the models to new domains. In response to these challenges, we introduce Parameter-efficient Scenario-specific Tracking Architecture (PASTA), a novel framework that combines Parameter-Efficient Fine-Tuning (PEFT) and Modular Deep Learning (MDL). Specifically, we define key scenario attributes (e.g., camera-viewpoint, lighting condition) and train specialized PEFT modules for each attribute. These expert modules are combined in parameter space, enabling systematic generalization to new domains without increasing inference time. Extensive experiments on MOT-Synth, along with zero-shot evaluations on MOT17 and PersonPath22 demonstrate that a neural tracker built from carefully selected modules surpasses its monolithic counterpart. We release models and code.
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Vision (0.96)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.89)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)
PASTA: Controllable Part-Aware Shape Generation with Autoregressive Transformers
Li, Songlin, Paschalidou, Despoina, Guibas, Leonidas
The increased demand for tools that automate the 3D content creation process led to tremendous progress in deep generative models that can generate diverse 3D objects of high fidelity. In this paper, we present PASTA, an autoregressive transformer architecture for generating high quality 3D shapes. PASTA comprises two main components: An autoregressive transformer that generates objects as a sequence of cuboidal primitives and a blending network, implemented with a transformer decoder that composes the sequences of cuboids and synthesizes high quality meshes for each object. Our model is trained in two stages: First we train our autoregressive generative model using only annotated cuboidal parts as supervision and next, we train our blending network using explicit 3D supervision, in the form of watertight meshes. Evaluations on various ShapeNet objects showcase the ability of our model to perform shape generation from diverse inputs \eg from scratch, from a partial object, from text and images, as well size-guided generation, by explicitly conditioning on a bounding box that defines the object's boundaries. Moreover, as our model considers the underlying part-based structure of a 3D object, we are able to select a specific part and produce shapes with meaningful variations of this part. As evidenced by our experiments, our model generates 3D shapes that are both more realistic and diverse than existing part-based and non part-based methods, while at the same time is simpler to implement and train.
- North America > United States > California > Santa Clara County > Palo Alto (0.04)
- Asia > Japan > Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)
- North America > United States > California > Santa Clara County > Stanford (0.04)
- (5 more...)
Fuelling the Tour de France: Secrets of the team kitchens
Not so long ago, the professional cycling world's approach to fuelling was remarkably basic. Options for riders barely extended beyond a monotonous menu of pasta, rice or whatever fare that night's hotel kitchen decided to serve up. These days, it is an entirely different prospect, with vast sums spent on custom-built food trucks, personalised nutrition apps and meticulously-planned meal regimes all in the name of performance enhancement. For the nutritionists and chefs tasked with providing sustenance to power their team's riders over 2,170 miles in the coming weeks there are principally two dilemmas: what food to prepare and how to do so in an ever-changing environment. The answers are gleaned from a year-round process that begins in December during pre-season training.
- Europe > France (0.40)
- Asia > Middle East > UAE (0.05)
STARLING: Self-supervised Training of Text-based Reinforcement Learning Agent with Large Language Models
Basavatia, Shreyas, Murugesan, Keerthiram, Ratnakar, Shivam
Interactive fiction games have emerged as an important application to improve the generalization capabilities of language-based reinforcement learning (RL) agents. Existing environments for interactive fiction games are domain-specific or time-consuming to generate and do not train the RL agents to master a specific set of skills. In this work, we introduce an interactive environment for self-supervised RL, STARLING, for text-based games that bootstraps the text-based RL agents with automatically generated games (based on the seed set of game ideas) to boost the performance and generalization capabilities to reach a goal of the target environment. These games let the agent hone their skills on a predefined set of tasks. We create and test an environment with 100 games, generated using this automated framework that uses large language models (GPT-3) and an interactive fiction game engine (based on Inform7) to provide the user with the ability to generate more games under minimal human supervision. Experimental results based on both the human participants and baseline text-based RL agents reveal that current state-of-the-art text-based RL agents cannot use previously learned skills in new situations at the level humans can. These results enforce STARLING's potential to serve as a sandbox environment for further research in self-supervised text-based RL.
- North America > United States (0.04)
- Europe > Sweden > Stockholm > Stockholm (0.04)
- Education (1.00)
- Leisure & Entertainment > Games > Computer Games (0.88)
AI is changing the world – and I've just eaten the underwhelming pasta that proves it Zing Tsjeng
It's been a drama-filled week for OpenAI, the creator of ChatGPT. Its wunderkind CEO Sam Altman has been unceremoniously booted out by its board and more than 600 staff members are now threatening to quit unless he's allowed back in. As a writer, I am of course duty-bound to swear on my copy of McNae's Essential Law for Journalists that I did not use OpenAI's chatbot to write this column – or did I? Even if I did, why would I fess up to it? Thanks to disastrously unpopular attempts by the likes of BuzzFeed to create AI-assisted content, its name is mud in the media industry.
- Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.88)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.47)
Visual AI and Linguistic Intelligence Through Steerability and Composability
Noever, David, Noever, Samantha Elizabeth Miller
This study explores the capabilities of multimodal large language models (LLMs) in handling challenging multistep tasks that integrate language and vision, focusing on model steerability, composability, and the application of long-term memory and context understanding. The problem addressed is the LLM's ability (Nov 2023 GPT-4 Vision Preview) to manage tasks that require synthesizing visual and textual information, especially where stepwise instructions and sequential logic are paramount. The research presents a series of 14 creatively and constructively diverse tasks, ranging from AI Lego Designing to AI Satellite Image Analysis, designed to test the limits of current LLMs in contexts that previously proved difficult without extensive memory and contextual understanding. Key findings from evaluating 800 guided dialogs include notable disparities in task completion difficulty. For instance, 'Image to Ingredient AI Bartender' (Low difficulty) contrasted sharply with 'AI Game Self-Player' (High difficulty), highlighting the LLM's varying proficiency in processing complex visual data and generating coherent instructions. Tasks such as 'AI Genetic Programmer' and 'AI Negotiator' showed high completion difficulty, emphasizing challenges in maintaining context over multiple steps. The results underscore the importance of developing LLMs that combine long-term memory and contextual awareness to mimic human-like thought processes in complex problem-solving scenarios.
- North America > United States > Tennessee (0.04)
- Europe > Czechia > Prague (0.04)
- North America > United States > Massachusetts > Suffolk County > Boston (0.04)
- (4 more...)
- Research Report (1.00)
- Instructional Material (1.00)
- Transportation (1.00)
- Leisure & Entertainment > Games (1.00)
- Information Technology > Security & Privacy (1.00)
- (4 more...)
Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs
Zhang, Qingru, Singh, Chandan, Liu, Liyuan, Liu, Xiaodong, Yu, Bin, Gao, Jianfeng, Zhao, Tuo
In human-written articles, we often leverage the subtleties of text style, such as bold and italics, to guide the attention of readers. These textual emphases are vital for the readers to grasp the conveyed information. When interacting with large language models (LLMs), we have a similar need - steering the model to pay closer attention to user-specified information, e.g., an instruction. Existing methods, however, are constrained to process plain text and do not support such a mechanism. This motivates us to introduce PASTA - Post-hoc Attention STeering Approach, a method that allows LLMs to read text with user-specified emphasis marks. To this end, PASTA identifies a small subset of attention heads and applies precise attention reweighting on them, directing the model attention to user-specified parts. Like prompting, PASTA is applied at inference time and does not require changing any model parameters. Experiments demonstrate that PASTA can substantially enhance an LLM's ability to follow user instructions or integrate new knowledge from user inputs, leading to a significant performance improvement on a variety of tasks, e.g., an average accuracy improvement of 22% for LLAMA-7B. Our code is publicly available at https://github.com/QingruZhang/PASTA .
- North America > United States > Pennsylvania (0.04)
- North America > United States > New York (0.04)
- Europe > Italy > Tuscany > Florence (0.04)
- (2 more...)
- Leisure & Entertainment (0.46)
- Media > Photography (0.46)
Revealed: The best pasta shape for holding sauce - so, how does your favourite stack up?
With its simple mix of ingredients and high nutritional value, it's no surprise pasta is one of the most popular foods in the world. Despite dating back thousands of years, the age-old question still remains – which pasta shape is the best for holding sauce? To mark World Pasta Day, MailOnline turned to online AI tool ChatGPT for the answer, and it came up with some controversial results. Top of the list was cascatelli, a relatively new pasta from America with a curved shape and distinctive ruffles, deliberately designed to carry sauce. Also in the top six were spaghetti, penne and the'bow tie' pasta farfalle – but an expert claims a lot depends on the type of sauce too.
- Europe > United Kingdom (0.16)
- Europe > Italy (0.07)
- Oceania > Australia (0.05)
- (3 more...)