AITopics | Food & Agriculture

Collaborating Authors

Food & Agriculture

MIRAGE: A Benchmark for Multimodal Information-Seeking and Reasoning in Agricultural Expert-Guided Conversations

Neural Information Processing SystemsJun-11-2026, 02:00:26 GMT

We introduce MIRAGE, a new benchmark for multimodal expert-level reasoning and decision-making in consultative interaction settings. Designed for the domain of agriculture, MIRAGE captures the full complexity of expert consultations by combining natural user queries, expert-authored responses, and image-based context, offering a high-fidelity benchmark for evaluating models on grounded reasoning, clarification strategies, and long-form generation in a real-world, knowledge-intensive domain. Grounded in over 35,000 real user-expert interactions, and curated through a carefully designed multi-step pipeline, MIRAGE spans diverse crop health, pest diagnosis, and crop management scenarios. The benchmark includes more than 7,000 unique biological entities, covering plant species, pests, and diseases, making it one of the most taxonomically diverse benchmarks available for vision-language models in real-world expert-guided domains. Unlike existing benchmarks that rely on well-specified user inputs, MIRAGE features underspecified, context-rich scenarios, requiring models to infer latent knowledge gaps and either proactively guide the interaction or respond. Our benchmark comprises two core components. The Single-turn Challenge to reason over a single user turn and image set, identify relevant entities, infer causal explanations, and generate actionable recommendations; and a Multi-Turn challenge for dialogue state tracking, goal-driven generation, and expert-level conversational decision-making. We evaluate more than 20 closed and open-source frontier vision-language models (VLMs), using three reasoning language models as evaluators, highlighting the significant challenges posed by MIRAGE in both single-turn and multi-turn interaction settings. Even the advanced GPT4.1 and GPT4o models achieve 44.6% and 40.9% accuracy, respectively, indicating significant room for improvement.

large language model, machine learning, natural language, (11 more...)

Neural Information Processing Systems

Industry: Food & Agriculture > Agriculture (0.95)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.49)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

4 lawn options for people who hate mowing

Grass alternatives can bring beauty (and bees) to your yard. More information Adding us as a Preferred Source in Google by using this link indicates that you would like to see more of our content in Google News results. If you dislike mowing the lawn, you have other options. Breakthroughs, discoveries, and DIY tips sent six days a week. By signing up, you confirm you are 16+, will receive newsletters and promotional content and agree to our Terms of Use and acknowledge the data practices in our Privacy Policy .

artificial intelligence, homeowner, no-mow lawn, (11 more...)

Popular Science

Industry:

Food & Agriculture > Agriculture (0.50)
Information Technology > Security & Privacy (0.36)

Technology: Information Technology > Artificial Intelligence (0.51)

Add feedback

Handle with care: Soft robot gripper picks ripe fruit without bruising

RobohubMay-27-2026, 15:44:38 GMT

When assessing the ripeness of fruit, sight and smell can tell you a lot, but the best indicator is often how the fruit feels. Cornell researchers used stretchable fiber-optic sensors to create a soft robot gripper that can predict the ripeness of strawberries by touch, then gently twist them off their branch or vine without causing any damage. The technology, developed in the lab of Rob Shepherd, the John F. Carr Professor of Mechanical Engineering in the Cornell Duffield College of Engineering, could lead to more resilient and ecological food production and increase the availability of fruit species that are difficult to cultivate. Shepherd's Organic Robotics Lab previously demonstrated the potential of stretchable fiber-optic sensors to give soft robotic systems the ability to feel the same dynamic, tactile sensations that enable humans to navigate the natural world. In recent years, the team has expanded into agriculture, designing a soft robotic gripper that injects living plant leaves with sensors that help it detect and communicate with its environment.

artificial intelligence, gripper, shepherd, (14 more...)

Robohub

Country: Europe (0.15)

Industry: Food & Agriculture > Agriculture (1.00)

Technology: Information Technology > Artificial Intelligence > Robots > Manipulation (0.35)

Add feedback

How to remove bamboo from your yard

More information Adding us as a Preferred Source in Google by using this link indicates that you would like to see more of our content in Google News results. If bamboo appears unexpectedly in your yard, don't panic. Breakthroughs, discoveries, and DIY tips sent six days a week. Bamboo may feel like an easy landscaping win because it's a fast-growing privacy screen that can turn a plain yard into a lush retreat. But then a few shoots start popping up in random places all over your yard.

artificial intelligence, bamboo, physics popular science video space, (9 more...)

Popular Science

Country: North America > United States (0.15)

Industry:

Energy (0.70)
Food & Agriculture > Agriculture (0.52)
Materials > Chemicals (0.32)

Technology: Information Technology > Artificial Intelligence (0.36)

Add feedback

Why sloths risk their lives to poop

Every week, sloths climb down to do their business on the forest floor--where predators lie in wait. More information Adding us as a Preferred Source in Google by using this link indicates that you would like to see more of our content in Google News results. Sloths can live up to 30 years in the wild. Breakthroughs, discoveries, and DIY tips sent six days a week. Every week, without fail, the three-toed sloth takes a breathtaking, almost suicidal risk--all for the sake of a bowel movement.

algae, artificial intelligence, sloth, (11 more...)

Popular Science

Country: North America > United States > Wisconsin (0.15)

Industry:

Health & Medicine (0.51)
Food & Agriculture > Agriculture (0.50)
Media (0.48)

Technology: Information Technology > Artificial Intelligence (0.50)

Add feedback

TabPFN-3: Technical Report

Grinsztajn, Léo, Flöge, Klemens, Key, Oscar, Birkel, Felix, Jund, Philipp, Roof, Brendan, Manium, Mihir, Bin, Shi, Hoo, null, Bühler, Magnus, Garg, Anurag, Safaric, Dominik, Robertson, Jake, Jäger, Benjamin, Alessi, Simone, Hayler, Adrian, Moroshan, Vladyslav, Purucker, Lennart, Singer, Philipp, Arazi, Alan, Siems, Julien, Metzen, Jan Hendrik, Grab, Georg, Erickson, Nick, Guo, Siyuan, Kalfon, Eliott, Bing, Simon, Salinas, David, Cornu, Clara, Wehrhahn, Lilly Charlotte, Kriuchkova, Diana, Kaya, Kursat, Sidhoum, Lydia, Salmon, Marie, Chen, Jerry, Hulsebos, Madelon, LeCun, Yann, Müller, Samuel, Schölkopf, Bernhard, Gambhir, Sauraj, Hollmann, Noah, Hutter, Frank

arXiv.org Machine LearningMay-15-2026

Tabular data underpins most high-value prediction problems in science and industry, and TabPFN has driven the foundation model revolution for this modality. Designed with feedback from our users, TabPFN-3 builds on this foundation to scale state-of-the-art performance to datasets with 1M training rows and substantially reduce training and inference time. Pretrained exclusively on synthetic data from our prior, TabPFN-3 dramatically pushes the frontier of tabular prediction and brings substantial gains on time series, relational, and tabular-text data. On the standard tabular benchmark TabArena, a forward pass of TabPFN-3 outperforms all other models, including tuned and ensembled baselines, by a significant margin, and pareto-dominates the speed/performance frontier. On more diverse datasets, TabPFN-3 ranks first on datasets with many classes, and beats 8-hour-tuned gradient-boosted-tree baselines on datasets up to 1M training rows and 200 features. TabPFN-3 introduces test-time compute scaling to tabular foundation models. Our API offering TabPFN-3-Plus (Thinking) exploits this to beat all non-TabPFN models by over 200 Elo on TabArena, rising to 420 Elo on the largest data subset, and outperforms AutoGluon 1.5 extreme while being 10x faster, without using LLMs, real data, internet search or any other model besides TabPFN. TabPFN-3 extends the capabilities of our models, enabling SOTA prediction on relational data (new SOTA foundation model on RelBenchV1) and tabular-text data (SOTA on TabSTAR via TabPFN-3-Plus); and improves existing integrations: a specialized checkpoint, TabPFN-TS-3, ranks 2nd on the time-series benchmark fev-bench, and SHAP-value computation is up to 120x faster. TabPFN-3 achieves this performance while being up to 20x faster than TabPFN-2.5. In addition, a reduced KV cache and row-chunking scale to 1M rows on one H100 with fast inference speed.

arXiv.org Machine Learning

2605.13986

Country:

Asia (1.00)
North America > United States (0.67)

Genre: Research Report > Experimental Study (1.00)

Industry:

Materials (1.00)
Information Technology > Security & Privacy (1.00)
Health & Medicine > Therapeutic Area > Oncology (1.00)
(21 more...)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
(6 more...)

Add feedback

TinyBayes: Closed-Form Bayesian Inference via Jacobi Prior for Real-Time Image Classification on Edge Devices

Sardar, Shouvik, Das, Sourish

arXiv.org Machine LearningMay-8-2026

Cocoa (Theobroma cacao) is a critical cash crop for millions of smallholder farmers in West Africa, where Cocoa Swollen Shoot Virus Disease (CSSVD) and anthracnose cause devastating yield losses. Automated disease detection from leaf images is essential for early intervention, yet deploying such systems in resource-constrained settings demands models that are small, fast, and require no internet connectivity. Existing edge-deployable plant disease systems rely on end-to-end deep learning without uncertainty quantification, while Bayesian methods for edge devices focus on hardware-level inference architectures rather than agricultural applications. We bridge this gap with TinyBayes, the first framework to combine a closed-form Bayesian classifier with a mobile-grade computer vision pipeline for crop disease detection. Our pipeline uses YOLOv8-Nano (5.9 MB) for lesion localisation, MobileNetV3-Small (3.5 MB) for feature extraction, and the Jacobi prior; a Bayesian method that provides a closed form non-iterative estimators via projection, for the classification. The Jacobi-DMR (Distributed Multinomial Regression) classifier adds only 13.5 KB to the pipeline, bringing the total model size within 9.5 MB, while achieving 78.7% accuracy on the Amini Cocoa Contamination Challenge dataset and enabling end-to-end CPU inference under 150 ms per image. We benchmark against seven classifiers including Random Forest, SVM, Ridge, Lasso, Elastic Net, XGBoost, and Jacobi-GP, and demonstrate that the Jacobi-DMR offers the best trade-off between accuracy, model size, and inference speed for edge deployment. We have proved the asymptotic equivalence and consistency, asymptotic normality and the bias correction of Jacobi-DMR. All data and codes are available here: https://github.com/shouvik-sardar/TinyBayes

artificial intelligence, bayesian inference, machine learning, (18 more...)

arXiv.org Machine Learning

2605.06333

Country: Africa > West Africa (0.24)

Genre: Research Report (0.82)

Industry:

Health & Medicine (0.47)
Food & Agriculture > Agriculture (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.91)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.86)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.75)

Add feedback

Why is everyone talking about watermelon buttholes?

Popular ScienceMay-2-2026, 12:00:00 GMT

Environment Agriculture Why is everyone talking about watermelon buttholes? A watermelon expert offers tips on picking the tastiest summer fruit (no buttholes required). More information Adding us as a Preferred Source in Google by using this link indicates that you would like to see more of our content in Google News results. Skip the blossom end and look for a yellow spot. Breakthroughs, discoveries, and DIY tips sent six days a week.

artificial intelligence, physics popular science video space, watermelon, (10 more...)

Popular Science

Industry: Food & Agriculture > Agriculture (0.31)

Technology:

Information Technology > Artificial Intelligence (0.37)
Information Technology > Communications (0.33)

Add feedback

Image Enabling AI for Biodiversity

Neural Information Processing SystemsApr-30-2026, 01:34:43 GMT

We introduce BioTrove, the largest publicly accessible dataset designed to advance AI applications in biodiversity. Curated from the iNaturalist platform and vetted to include only research-grade data, BioTrove contains 161.9 million images, offering unprecedented scale and diversity from three primary kingdoms: Animalia ("animals"), Fungi ("fungi"), and Plantae ("plants"), spanning approximately 366.6K species. Each image is annotated with scientific names, taxonomic hierarchies, and common names, providing rich metadata to support accurate AI model development across diverse species and ecosystems. We demonstrate the value of BioTrove by releasing a suite of CLIP models trained using a subset of 40 million captioned images, known as BioTrove-Train. This subset focuses on seven categories within the dataset that are underrepresented in standard image recognition models, selected for their critical role in biodiversity and agriculture: Aves ("birds"), Arachnida ("spiders/ticks/mites"), Insecta ("insects"), Plantae ("plants"), Fungi ("fungi"), Mollusca ("snails"), and Reptilia ("snakes/lizards"). To support rigorous assessment, we introduce several new benchmarks and report model accuracy for zero-shot learning across life stages, rare species, confounding species, and multiple taxonomic levels. We anticipate that BioTrove will spur the development of AI models capable of supporting digital tools for pest control, crop monitoring, biodiversity assessment, and environmental conservation. These advancements are crucial for ensuring food security, preserving ecosystems, and mitigating the impacts of climate change. BioTrove is publicly available, easily accessible, and ready for immediate use.

artificial intelligence, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Country: North America > United States > Arizona (0.28)

Genre: Research Report > New Finding (0.46)

Industry: Food & Agriculture > Agriculture (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Parallel Bayesian Optimization of Multiple Noisy Objectives with Expected Hypervolume Improvement

Neural Information Processing SystemsApr-24-2026, 18:28:11 GMT

Optimizing multiple competing black-box objectives is a challenging problem in many fields, including science, engineering, and machine learning. Multi-objective Bayesian optimization (MOBO) is a sample-efficient approach for identifying the optimal trade-offs between the objectives. However, many existing methods perform poorly when the observations are corrupted by noise. We propose a novel acquisition function, NEHVI, that overcomes this important practical limitation by applying a Bayesian treatment to the popular expected hypervolume improvement (EHVI) criterion and integrating over this uncertainty in the Pareto frontier. We argue that, even in the noiseless setting, generating multiple candidates in parallel is an incarnation of EHVI with uncertainty in the Pareto frontier and therefore can be addressed using the same underlying technique. Through this lens, we derive a natural parallel variant, qNEHVI, that reduces computational complexity of parallel EHVI from exponential to polynomial with respect to the batch size.

artificial intelligence, machine learning, optimization, (14 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Industry:

Transportation (0.67)
Food & Agriculture > Agriculture (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback