AITopics | plank

Collaborating Authors

plank

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

6b8dfb8c0c12e6fafc6c256cb08a5ca7-Paper-Conference.pdf

Neural Information Processing SystemsFeb-13-2026, 13:55:30 GMT

large language model, machine learning, natural language, (22 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Asia > Vietnam > Hanoi > Hanoi (0.04)
Asia > China > Beijing > Beijing (0.04)
(2 more...)

Genre: Workflow (0.51)

Industry:

Leisure & Entertainment > Games (0.72)
Materials > Metals & Mining > Iron (0.31)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(2 more...)

Add feedback

SCOPE: Language Models as One-Time Teacher for Hierarchical Planning in Text Environments

Lu, Haoye, Seshadri, Pavan, Suleman, Kaheer

arXiv.org Artificial IntelligenceDec-11-2025

Long-term planning in complex, text-based environments presents significant challenges due to open-ended action spaces, ambiguous observations, and sparse feedback. Recent research suggests that large language models (LLMs) encode rich semantic knowledge about the world, which can be valuable for guiding agents in high-level reasoning and planning across both embodied and purely textual settings. However, existing approaches often depend heavily on querying LLMs during training and inference, making them computationally expensive and difficult to deploy efficiently. In addition, these methods typically employ a pretrained, unaltered LLM whose parameters remain fixed throughout training, providing no opportunity for adaptation to the target task. To address these limitations, we introduce SCOPE (Subgoal-COnditioned Pretraining for Efficient planning), a one-shot hierarchical planner that leverages LLM-generated subgoals only at initialization to pretrain a lightweight student model. Unlike prior approaches that distill LLM knowledge by repeatedly prompting the model to adaptively generate subgoals during training, our method derives subgoals directly from example trajectories. This design removes the need for repeated LLM queries, significantly improving efficiency, though at the cost of reduced explainability and potentially suboptimal subgoals. Despite their suboptimality, our results on the TextCraft environment show that LLM-generated subgoals can still serve as a strong starting point for hierarchical goal decomposition in text-based planning tasks. Compared to the LLM-based hierarchical agent ADaPT (Prasad et al., 2024), which achieves a 0.52 success rate, our method reaches 0.56 and reduces inference time from 164.4 seconds to just 3.0 seconds.

large language model, machine learning, plank, (20 more...)

arXiv.org Artificial Intelligence

2512.09897

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Europe > Monaco (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
(4 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Leisure & Entertainment (0.48)
Education (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.96)

Add feedback

If you could upload your mind to a virtual utopia, would you?

New ScientistOct-31-2025, 09:30:31 GMT

"What does it really mean to upload your consciousness into intangible space?" In, the characters face an impossible choice: upload your mind into a virtual utopia, or crumble away in the abandoned physical world. Mind-uploading is familiar to us as a science fiction trope, often anchoring relationship dramas and philosophical inquiry. But what does it really mean to upload your consciousness into intangible space? Can the mechanics be extrapolated from our present-day science?

artificial intelligence, social media, upload, (14 more...)

New Scientist

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence (1.00)

Add feedback

6b8dfb8c0c12e6fafc6c256cb08a5ca7-Paper-Conference.pdf

Neural Information Processing SystemsOct-8-2025, 20:44:47 GMT

large language model, machine learning, natural language, (22 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Asia > Vietnam > Hanoi > Hanoi (0.04)
Asia > China > Beijing > Beijing (0.04)
(2 more...)

Industry:

Materials > Metals & Mining (0.96)
Leisure & Entertainment > Sports > Golf (0.95)
Leisure & Entertainment > Games (0.72)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.93)
(2 more...)

Add feedback

LogicGuard: Improving Embodied LLM agents through Temporal Logic based Critics

Gokhale, Anand, Srivastava, Vaibhav, Bullo, Francesco

arXiv.org Artificial IntelligenceSep-24-2025

Large language models (LLMs) have shown promise in zero-shot and single step reasoning and decision making problems, but in long horizon sequential planning tasks, their errors compound, often leading to unreliable or inefficient behavior. We introduce LogicGuard, a modular actor-critic architecture in which an LLM actor is guided by a trajectory level LLM critic that communicates through Linear Temporal Logic (LTL). Our setup combines the reasoning strengths of language models with the guarantees of formal logic. The actor selects high-level actions from natural language observations, while the critic analyzes full trajectories and proposes new LTL constraints that shield the actor from future unsafe or inefficient behavior. LogicGuard supports both fixed safety rules and adaptive, learned constraints, and is model-agnostic: any LLM-based planner can serve as the actor, with LogicGuard acting as a logic-generating wrapper. We formalize planning as graph traversal under symbolic constraints, allowing LogicGuard to analyze failed or suboptimal trajectories and generate new temporal logic rules that improve future behavior. To demonstrate generality, we evaluate LogicGuard across two distinct settings: short-horizon general tasks and long-horizon specialist tasks. On the Behavior benchmark of 100 household tasks, LogicGuard increases task completion rates by 25% over a baseline InnerMonologue planner. On the Minecraft diamond-mining task, which is long-horizon and requires multiple interdependent subgoals, LogicGuard improves both efficiency and safety compared to SayCan and InnerMonologue. These results show that enabling LLMs to supervise each other through temporal logic yields more reliable, efficient and safe decision-making for both embodied agents.

artificial intelligence, large language model, natural language, (17 more...)

arXiv.org Artificial Intelligence

2507.03293

Country:

North America > Mexico > Gulf of Mexico (0.28)
North America > United States > Michigan (0.04)
North America > United States > California (0.04)
(3 more...)

Genre: Research Report > New Finding (0.48)

Industry:

Law (0.93)
Materials > Metals & Mining > Diamonds (0.66)
Materials > Metals & Mining > Iron (0.48)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

NLPnorth @ TalentCLEF 2025: Comparing Discriminative, Contrastive, and Prompt-Based Methods for Job Title and Skill Matching

Zhang, Mike, van der Goot, Rob

arXiv.org Artificial IntelligenceJun-25-2025

Matching job titles is a highly relevant task in the computational job market domain, as it improves e.g., automatic candidate matching, career path prediction, and job market analysis. Furthermore, aligning job titles to job skills can be considered an extension to this task, with similar relevance for the same downstream tasks. In this report, we outline NLPnorth's submission to TalentCLEF 2025, which includes both of these tasks: Multilingual Job Title Matching, and Job Title-Based Skill Prediction. For both tasks we compare (fine-tuned) classification-based, (fine-tuned) contrastive-based, and prompting methods. We observe that for Task A, our prompting approach performs best with an average of 0.492 mean average precision (MAP) on test data, averaged over English, Spanish, and German. For Task B, we obtain an MAP of 0.290 on test data with our fine-tuned classification-based approach. Additionally, we made use of extra data by pulling all the language-specific titles and corresponding \emph{descriptions} from ESCO for each job and skill. Overall, we find that the largest multilingual language models perform best for both tasks. Per the provisional results and only counting the unique teams, the ranking on Task A is 5$^{\text{th}}$/20 and for Task B 3$^{\text{rd}}$/14.

computational linguistic, large language model, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2506.19058

Country:

North America > United States > California > San Francisco County > San Francisco (0.28)
Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)
Europe > Middle East > Malta > Eastern Region > Northern Harbour District > St. Julian's (0.05)
(9 more...)

Genre: Research Report (0.66)

Industry: Marketing (0.49)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.70)

Add feedback

BAR: A Backward Reasoning based Agent for Complex Minecraft Tasks

Du, Weihong, Liao, Wenrui, Yan, Binyu, Liang, Hongru, Cohn, Anthony G., Lei, Wenqiang

arXiv.org Artificial IntelligenceJun-2-2025

Large language model (LLM) based agents have shown great potential in following human instructions and automatically completing various tasks. To complete a task, the agent needs to decompose it into easily executed steps by planning. Existing studies mainly conduct the planning by inferring what steps should be executed next starting from the agent's initial state. However, this forward reasoning paradigm doesn't work well for complex tasks. We propose to study this issue in Minecraft, a virtual environment that simulates complex tasks based on real-world scenarios. We believe that the failure of forward reasoning is caused by the big perception gap between the agent's initial state and task goal. To this end, we leverage backward reasoning and make the planning starting from the terminal state, which can directly achieve the task goal in one step. Specifically, we design a BAckward Reasoning based agent (BAR). It is equipped with a recursive goal decomposition module, a state consistency maintaining module and a stage memory module to make robust, consistent, and efficient planning starting from the terminal state. Experimental results demonstrate the superiority of BAR over existing methods and the effectiveness of proposed modules.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2505.14079

Country:

Europe > United Kingdom > England > West Yorkshire > Leeds (0.04)
Asia > Thailand > Bangkok > Bangkok (0.04)
Asia > China > Sichuan Province (0.04)

Genre: Research Report > New Finding (0.48)

Industry: Leisure & Entertainment > Games > Computer Games (0.74)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

Stow: Robotic Packing of Items into Fabric Pods

Hudson, Nicolas, Hooks, Josh, Warrier, Rahul, Salisbury, Curt, Hartley, Ross, Kumar, Kislay, Chandrashekhar, Bhavana, Birkmeyer, Paul, Tang, Bosch, Frost, Matt, Thakar, Shantanu, Piaskowy, Tony, Nilsson, Petter, Petersen, Josh, Doshi, Neel, Slatter, Alan, Bhatia, Ankit, Meeker, Cassie, Xue, Yuechuan, Cox, Dylan, Kyriazis, Alex, Lou, Bai, Hasan, Nadeem, Rana, Asif, Chacko, Nikhil, Xu, Ruinian, Faal, Siamak, Seraj, Esi, Agrawal, Mudit, Jamieson, Kevin, Bisagni, Alessio, Samzun, Valerie, Fuller, Christine, Keklak, Alex, Frenkel, Alex, Ratliff, Lillian, Parness, Aaron

arXiv.org Artificial IntelligenceMay-8-2025

This paper presents a compliant manipulation system capable of placing items onto densely packed shelves. The wide diversity of items and strict business requirements for high producing rates and low defect generation have prohibited warehouse robotics from performing this task. Our innovations in hardware, perception, decision-making, motion planning, and control have enabled this system to perform over 500,000 stows in a large e-commerce fulfillment center. The system achieves human levels of packing density and speed while prioritizing work on overhead shelves to enhance the safety of humans working alongside the robots.

artificial intelligence, bin, manipulation, (18 more...)

arXiv.org Artificial Intelligence

2505.04572

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Genre: Research Report (0.82)

Industry:

Information Technology > Services > e-Commerce Services (0.54)
Transportation > Freight & Logistics Services (0.54)

Technology: Information Technology > Artificial Intelligence > Robots > Robot Planning & Action (0.67)

Add feedback

Collaborating Action by Action: A Multi-agent LLM Framework for Embodied Reasoning

White, Isadora, Nottingham, Kolby, Maniar, Ayush, Robinson, Max, Lillemark, Hansen, Maheshwari, Mehul, Qin, Lianhui, Ammanabrolu, Prithviraj

arXiv.org Artificial IntelligenceApr-28-2025

Collaboration is ubiquitous and essential in day-to-day life -- from exchanging ideas, to delegating tasks, to generating plans together. This work studies how LLMs can adaptively collaborate to perform complex embodied reasoning tasks. To this end we introduce MINDcraft, an easily extensible platform built to enable LLM agents to control characters in the open-world game of Minecraft; and MineCollab, a benchmark to test the different dimensions of embodied and collaborative reasoning. An experimental study finds that the primary bottleneck in collaborating effectively for current state-of-the-art agents is efficient natural language communication, with agent performance dropping as much as 15% when they are required to communicate detailed task completion plans. We conclude that existing LLM agents are ill-optimized for multi-agent collaboration, especially in embodied scenarios, and highlight the need to employ methods beyond in-context and imitation learning. Our website can be found here: https://mindcraft-minecollab.github.io/

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2504.1795

Country:

North America > United States > California > San Diego County > San Diego (0.04)
North America > Mexico > Mexico City > Mexico City (0.04)
North America > Dominican Republic (0.04)
Asia > China > Hong Kong (0.04)

Genre:

Research Report > New Finding (0.34)
Research Report > Experimental Study (0.34)

Industry: Leisure & Entertainment > Games > Computer Games (0.55)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.46)

Add feedback

Optimus-2: Multimodal Minecraft Agent with Goal-Observation-Action Conditioned Policy

Li, Zaijing, Xie, Yuquan, Shao, Rui, Chen, Gongwei, Jiang, Dongmei, Nie, Liqiang

arXiv.org Artificial IntelligenceMar-11-2025

Building an agent that can mimic human behavior patterns to accomplish various open-world tasks is a long-term goal. To enable agents to effectively learn behavioral patterns across diverse tasks, a key challenge lies in modeling the intricate relationships among observations, actions, and language. To this end, we propose Optimus-2, a novel Minecraft agent that incorporates a Multimodal Large Language Model (MLLM) for high-level planning, alongside a Goal-Observation-Action Conditioned Policy (GOAP) for low-level control. GOAP contains (1) an Action-guided Behavior Encoder that models causal relationships between observations and actions at each timestep, then dynamically interacts with the historical observation-action sequence, consolidating it into fixed-length behavior tokens, and (2) an MLLM that aligns behavior tokens with open-ended language instructions to predict actions auto-regressively. Moreover, we introduce a high-quality Minecraft Goal-Observation-Action (MGOA)} dataset, which contains 25,000 videos across 8 atomic tasks, providing about 30M goal-observation-action pairs. The automated construction method, along with the MGOA dataset, can contribute to the community's efforts to train Minecraft agents. Extensive experimental results demonstrate that Optimus-2 exhibits superior performance across atomic tasks, long-horizon tasks, and open-ended instruction tasks in Minecraft. Please see the project page at https://cybertronagent.github.io/Optimus-2.github.io/.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2502.19902

Country:

Asia > China > Heilongjiang Province > Harbin (0.04)
Asia > China > Guangdong Province > Shenzhen (0.04)

Genre: Research Report > New Finding (0.34)

Industry:

Materials > Metals & Mining (1.00)
Leisure & Entertainment > Games > Computer Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(2 more...)

Add feedback