explorer
- Asia > China > Beijing > Beijing (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Europe > Sweden > Skåne County > Malmö (0.04)
- Asia > Middle East > Jordan (0.04)
Inside the world's longest underwater cave: Subterranean water 'web' in Mexico extends at least 325 MILES
Leaked recording reveals Campbell's exec's sickening remarks about iconic soup's ingredients How Lauren Sanchez would REALLY look if she'd never had rumored plastic surgery Trump's losing control... MAGA's imploding... and White House insiders tell me why they're REALLY worried: ANDREW NEIL Billionaire family posts VERY unusual obituary after heir, 40, met violent end at $2.8m hunting lodge following marriage scandal These women have lost as much as nine stone WITHOUT jabs: Now they reveal secret to their stunning success, the extraordinary event that brought them together and how it's changed their lives... Judge throws out Comey and James cases as Trump's beauty queen prosecutor is humiliated Her moving videos about the handsome boyfriend who ghosted her went viral and catapulted her to overnight fame. Kate Gosselin's ex Jon is seen at his splashy wedding for the first time as son Collin weighs in on his siblings not attending Fugitive'Slender Man' stabber Morgan Geyser snapped'just Google me' when asked for ID by cops who found her with MUCH older lover It all seems to be falling apart now! Pete Hegseth drops hammer on Democrat senator in'sedition' storm as court martial looms after Trump's execution threat Sabrina Carpenter looks unrecognisable in throwback snap from seven years ago as fans call her rebranding'wild' Neuralink's'Patient 4' feared missing months after getting revolutionary brain chip... now his wife tells the REAL heartbreaking story NFL's first transgender cheerleader makes explosive allegation against Carolina Panthers Slash your cholesterol by a third in just a month... hundreds of thousands are on a new diet that's transforming lives. Inside the world's longest underwater cave: Subterranean water'web' in Mexico extends at least 325 MILES Beneath the idyllic resort towns of Mexico's Yucatan Peninsula, daring explorers have uncovered a hidden world of grand chambers and twisting tunnels. The Ox Bel Ha, Mayan for'Three Paths of Water', is a sprawling water'web' that makes up the world's longest underwater cave system.
- North America > Mexico > Yucatán (0.25)
- North America > Canada > Alberta (0.14)
- North America > United States > Nevada > Clark County > Las Vegas (0.05)
- (18 more...)
- Media > Television (1.00)
- Media > Music (1.00)
- Media > Film (1.00)
- (3 more...)
Student Engagement in AI Assisted Complex Problem Solving: A Pilot Study of Human AI Rubik's Cube Collaboration
Vanacore, Kirk, Ocumpaugh, Jaclyn, Agostinelli, Forest, Wu, Dezhi, Vuruma, Sai, Irvin, Matt
Games and puzzles play important pedagogical roles in STEM learning. New AI algorithms that can solve complex problems offer opportunities for scaffolded instruction in puzzle solving. This paper presents the ALLURE system, which uses an AI algorithm (Deep CubeA) to guide students in solving a common first step of the Rubik's Cube (i.e., the white cross). Using data from a pilot study we present preliminary findings about students' behaviors in the system, how these behaviors are associated with STEM skills - including spatial reasoning, critical thinking and algorithmic thinking. We discuss how data from ALLURE can be used in future educational data mining to understand how students benefit from AI assistance and collaboration when solving complex problems.
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
- Europe > Austria > Vienna (0.14)
- North America > United States > South Carolina (0.05)
- (2 more...)
- Instructional Material (1.00)
- Research Report > Experimental Study (0.68)
- Research Report > New Finding (0.46)
- Education > Curriculum > Subject-Specific Education (1.00)
- Leisure & Entertainment > Games > Rubik's Cube (0.75)
WebThinker: Empowering Large Reasoning Models with Deep Research Capability
Li, Xiaoxi, Jin, Jiajie, Dong, Guanting, Qian, Hongjin, Wu, Yongkang, Wen, Ji-Rong, Zhu, Yutao, Dou, Zhicheng
Large reasoning models (LRMs), such as OpenAI-o1 and DeepSeek-R1, demonstrate impressive long-horizon reasoning capabilities. However, their reliance on static internal knowledge limits their performance on complex, knowledge-intensive tasks and hinders their ability to produce comprehensive research reports requiring synthesis of diverse web information. To address this, we propose WebThinker, a deep research agent that empowers LRMs to autonomously search the web, navigate among web pages, and draft reports during the reasoning process. WebThinker integrates a Deep Web Explorer module, enabling LRMs to dynamically search, navigate, and extract information from the web when encountering knowledge gaps. It also employs an Autonomous Think-Search-and-Draft strategy, allowing the model to seamlessly interleave reasoning, information gathering, and report writing in real time. To further enhance research tool utilization, we introduce an RL-based training strategy via iterative online Direct Preference Optimization (DPO). Extensive experiments on complex reasoning benchmarks (GPQA, GAIA, WebWalkerQA, HLE) and scientific report generation tasks (Glaive) demonstrate that WebThinker significantly outperforms existing methods and strong proprietary systems. Our approach enhances LRM reliability and applicability in complex scenarios, paving the way for more capable and versatile deep research systems. The code is available at https://github.com/RUC-NLPIR/WebThinker.
- Europe > Austria > Vienna (0.14)
- Asia > Southeast Asia (0.04)
- Asia > Singapore (0.04)
- (14 more...)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
Do LLMs Really Need 10+ Thoughts for "Find the Time 1000 Days Later"? Towards Structural Understanding of LLM Overthinking
Zhang, Xinliang Frederick, Mohananey, Anhad, Chronopoulou, Alexandra, Papalampidi, Pinelopi, Gupta, Somit, Munkhdalai, Tsendsuren, Wang, Lu, Upadhyay, Shyam
Models employing long chain-of-thought (CoT) reasoning have shown superior performance on complex reasoning tasks. Yet, this capability introduces a critical and often overlooked inefficiency -- overthinking -- models often engage in unnecessarily extensive reasoning even for simple queries, incurring significant computations without accuracy improvements. While prior work has explored solutions to mitigate overthinking, a fundamental gap remains in our understanding of its underlying causes. Most existing analyses are limited to superficial, profiling-based observations, failing to delve into LLMs' inner workings. This study introduces a systematic, fine-grained analyzer of LLMs' thought process to bridge the gap, TRACE. We first benchmark the overthinking issue, confirming that long-thinking models are five to twenty times slower on simple tasks with no substantial gains. We then use TRACE to first decompose the thought process into minimally complete sub-thoughts. Next, by inferring discourse relationships among sub-thoughts, we construct granular thought progression graphs and subsequently identify common thinking patterns for topically similar queries. Our analysis reveals two major patterns for open-weight thinking models -- Explorer and Late Landing. This finding provides evidence that over-verification and over-exploration are the primary drivers of overthinking in LLMs. Grounded in thought structures, we propose a utility-based definition of overthinking, which moves beyond length-based metrics. This revised definition offers a more insightful understanding of LLMs' thought progression, as well as practical guidelines for principled overthinking management.
- Europe > Austria > Vienna (0.14)
- North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
- Oceania > Australia > Victoria > Melbourne (0.04)
- (7 more...)
- Asia > China > Beijing > Beijing (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Europe > Sweden > Skåne County > Malmö (0.04)
- Asia > Middle East > Jordan (0.04)
How to Get Your Kids Into STEM Even When Its Future Is Uncertain
Thinking about science and technology in terms of return on investment misses the point. Here's what kids really need to know. That's what led me to become a professor. As a high school student, one of my major life goals was to figure out how to build an actual light sword. Doing so is all but impossible, so it didn't really matter if I went into engineering or science, but I pursued STEM just the same.
- North America > United States > Louisiana (0.05)
- North America > United States > California (0.05)
- Europe > Switzerland (0.05)
- (4 more...)
A2R: An Asymmetric Two-Stage Reasoning Framework for Parallel Reasoning
Wang, Ziqi, Niu, Boye, Li, Zhongli, Meng, Linghui, Liu, Jing, Zheng, Zhi, Xu, Tong, Wu, Hua, Wang, Haifeng, Chen, Enhong
Recent Large Reasoning Models have achieved significant improvements in complex task-solving capabilities by allocating more computation at the inference stage with a "thinking longer" paradigm. Even as the foundational reasoning capabilities of models advance rapidly, the persistent gap between a model's performance in a single attempt and its latent potential, often revealed only across multiple solution paths, starkly highlights the disparity between its realized and inherent capabilities. To address this, we present A2R, an Asymmetric Two-Stage Reasoning framework designed to explicitly bridge the gap between a model's potential and its actual performance. In this framework, an "explorer" model first generates potential solutions in parallel through repeated sampling. Subsequently,a "synthesizer" model integrates these references for a more refined, second stage of reasoning. This two-stage process allows computation to be scaled orthogonally to existing sequential methods. Our work makes two key innovations: First, we present A2R as a plug-and-play parallel reasoning framework that explicitly enhances a model's capabilities on complex questions. For example, using our framework, the Qwen3-8B-distill model achieves a 75% performance improvement compared to its self-consistency baseline. Second, through a systematic analysis of the explorer and synthesizer roles, we identify an effective asymmetric scaling paradigm. This insight leads to A2R-Efficient, a "small-to-big" variant that combines a Qwen3-4B explorer with a Qwen3-8B synthesizer. This configuration surpasses the average performance of a monolithic Qwen3-32B model at a nearly 30% lower cost. Collectively, these results show that A2R is not only a performance-boosting framework but also an efficient and practical solution for real-world applications.
- North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
- Asia > Singapore (0.04)
- Asia > Middle East > Jordan (0.04)
- (4 more...)
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.69)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)
- North America > United States > California (1.00)
- Europe (1.00)
- Information Technology > Artificial Intelligence > Robots > Robot Planning & Action (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.93)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)
Imagine, Verify, Execute: Memory-guided Agentic Exploration with Vision-Language Models
Lee, Seungjae, Ekpo, Daniel, Liu, Haowen, Huang, Furong, Shrivastava, Abhinav, Huang, Jia-Bin
Exploration is essential for general-purpose robotic learning, especially in open-ended environments where dense rewards, explicit goals, or task-specific supervision are scarce. Vision-language models (VLMs), with their semantic reasoning over objects, spatial relations, and potential outcomes, present a compelling foundation for generating high-level exploratory behaviors. However, their outputs are often ungrounded, making it difficult to determine whether imagined transitions are physically feasible or informative. To bridge the gap between imagination and execution, we present IVE (Imagine, Verify, Execute), an agentic exploration framework inspired by human curiosity. Human exploration is often driven by the desire to discover novel scene configurations and to deepen understanding of the environment. Similarly, IVE leverages VLMs to abstract RGB-D observations into semantic scene graphs, imagine novel scenes, predict their physical plausibility, and generate executable skill sequences through action tools. We evaluate IVE in both simulated and real-world tabletop environments. The results show that IVE enables more diverse and meaningful exploration than RL baselines, as evidenced by a 4.1 to 7.8x increase in the entropy of visited states. Moreover, the collected experience supports downstream learning, producing policies that closely match or exceed the performance of those trained on human-collected demonstrations.
- North America > United States > Maryland > Prince George's County > College Park (0.04)
- Asia > South Korea > Seoul > Seoul (0.04)