Exploring Object Status Recognition for Recipe Progress Tracking in Non-Visual Cooking
Li, Franklin Mingzhe, Ng, Kaitlyn, Zhu, Bin, Carrington, Patrick
–arXiv.org Artificial Intelligence
Cooking plays a vital role in everyday independence and well-being, yet remains challenging for people with vision impairments due to limited support for tracking progress and receiving contextual feedback. Object status - the condition or transformation of ingredients and tools - offers a promising but underexplored foundation for context-aware cooking support. In this paper, we present OSCAR (Object Status Context Awareness for Recipes), a technical pipeline that explores the use of object status recognition to enable recipe progress tracking in non-visual cooking. OSCAR integrates recipe parsing, object status extraction, visual alignment with cooking steps, and time-causal modeling to support real-time step tracking. We evaluate OSCAR on 173 instructional videos and a real-world dataset of 12 non-visual cooking sessions recorded by BLV individuals in their homes. Our results show that object status consistently improves step prediction accuracy across vision-language models, and reveal key factors that impact performance in real-world conditions, such as implicit tasks, camera placement, and lighting. We contribute the pipeline of context-aware recipe progress tracking, an annotated real-world non-visual cooking dataset, and design insights to guide future context-aware assistive cooking systems.
arXiv.org Artificial Intelligence
Jul-8-2025
- Country:
- North America > United States
- New York (0.04)
- Pennsylvania > Allegheny County
- Pittsburgh (0.14)
- Florida > Hillsborough County
- University (0.40)
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- Asia > Singapore
- Central Region > Singapore (0.04)
- North America > United States
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Health & Medicine > Consumer Health (1.00)
- Media (0.93)
- Education > Educational Technology
- Audio & Video (0.34)
- Technology:
- Information Technology > Artificial Intelligence
- Vision (1.00)
- Machine Learning (1.00)
- Natural Language > Chatbot (0.46)
- Representation & Reasoning > Personal Assistant Systems (0.46)
- Information Technology > Artificial Intelligence