euclid
Distilling LLM Agent into Small Models with Retrieval and Code Tools
Kang, Minki, Jeong, Jongwon, Lee, Seanie, Cho, Jaewoong, Hwang, Sung Ju
Large language models (LLMs) excel at complex reasoning tasks but remain computationally expensive, limiting their practical deployment. To address this, recent works have focused on distilling reasoning capabilities into smaller language models (sLMs) using chain-of-thought (CoT) traces from teacher LLMs. However, this approach struggles in scenarios requiring rare factual knowledge or precise computation, where sLMs often hallucinate due to limited capability. In this work, we propose Agent Distillation, a framework for transferring not only reasoning capability but full task-solving behavior from LLM-based agents into sLMs with retrieval and code tools. We improve agent distillation along two complementary axes: (1) we introduce a prompting method called first-thought prefix to enhance the quality of teacher-generated trajectories; and (2) we propose a self-consistent action generation for improving test-time robustness of small agents. We evaluate our method on eight reasoning tasks across factual and mathematical domains, covering both in-domain and out-of-domain generalization. Our results show that sLMs as small as 0.5B, 1.5B, 3B parameters can achieve performance competitive with next-tier larger 1.5B, 3B, 7B models fine-tuned using CoT distillation, demonstrating the potential of agent distillation for building practical, tool-using small agents. Our code is available at https://github.com/Nardien/agent-distillation.
- North America > United States > Ohio > Cuyahoga County > Euclid (0.14)
- Europe > Austria > Vienna (0.14)
- North America > United States > Florida > Miami-Dade County > Miami (0.14)
- (18 more...)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
- Education (1.00)
- Information Technology (0.67)
AI's Euclid's Elements Moment: From Language Models to Computable Thought
Fang, Xinmin, Tao, Lingfeng, Li, Zhengxiong
This paper presents a comprehensive five-stage evolutionary framework for understanding the development of artificial intelligence, arguing that its trajectory mirrors the historical progression of human cognitive technologies. We posit that AI is advancing through distinct epochs, each defined by a revolutionary shift in its capacity for representation and reasoning, analogous to the inventions of cuneiform, the alphabet, grammar and logic, mathematical calculus, and formal logical systems. This "Geometry of Cognition" framework moves beyond mere metaphor to provide a systematic, cross-disciplinary model that not only explains AI's past architectural shifts-from expert systems to Transformers-but also charts a concrete and prescriptive path forward. Crucially, we demonstrate that this evolution is not merely linear but reflexive: as AI advances through these stages, the tools and insights it develops create a feedback loop that fundamentally reshapes its own underlying architecture. We are currently transitioning into a "Metalinguistic Moment," characterized by the emergence of self-reflective capabilities like Chain-of-Thought prompting and Constitutional AI. The subsequent stages, the "Mathematical Symbolism Moment" and the "Formal Logic System Moment," will be defined by the development of a computable calculus of thought, likely through neuro-symbolic architectures and program synthesis, culminating in provably aligned and reliable AI that reconstructs its own foundational representations. This work serves as the methodological capstone to our trilogy, which previously explored the economic drivers ("why") and cognitive nature ("what") of AI. Here, we address the "how," providing a theoretical foundation for future research and offering concrete, actionable strategies for startups and developers aiming to build the next generation of intelligent systems.
- North America > United States > Colorado > Denver County > Denver (0.14)
- Europe > Greece (0.04)
- North America > United States > Louisiana (0.04)
- (2 more...)
- Health & Medicine (0.93)
- Law (0.68)
- Transportation (0.68)
Discrete Speech Unit Extraction via Independent Component Analysis
Nakamura, Tomohiko, Choi, Kwanghee, Hojo, Keigo, Bando, Yoshiaki, Fukayama, Satoru, Watanabe, Shinji
Self-supervised speech models (S3Ms) have become a common tool for the speech processing community, leveraging representations for downstream tasks. Clustering S3M representations yields discrete speech units (DSUs), which serve as compact representations for speech signals. DSUs are typically obtained by k-means clustering. Using DSUs often leads to strong performance in various tasks, including automatic speech recognition (ASR). However, even with the high dimensionality and redundancy of S3M representations, preprocessing S3M representations for better clustering remains unexplored, even though it can affect the quality of DSUs. In this paper, we investigate the potential of linear preprocessing methods for extracting DSUs. We evaluate standardization, principal component analysis, whitening, and independent component analysis (ICA) on DSU-based ASR benchmarks and demonstrate their effectiveness as preprocessing for k-means. We also conduct extensive analyses of their behavior, such as orthogonality or interpretability of individual components of ICA.
- North America > United States (0.28)
- Asia > Japan (0.14)
Euclid: Supercharging Multimodal LLMs with Synthetic High-Fidelity Visual Descriptions
Zhang, Jiarui, Liu, Ollie, Yu, Tianyu, Hu, Jinyi, Neiswanger, Willie
Multimodal large language models (MLLMs) have made rapid progress in recent years, yet continue to struggle with low-level visual perception (LLVP) -- particularly the ability to accurately describe the geometric details of an image. This capability is crucial for applications in areas such as robotics, medical image analysis, and manufacturing. In this paper, we first introduce Geoperception, a benchmark designed to evaluate an MLLM's ability to accurately transcribe 2D geometric information from an image. Using this benchmark, we demonstrate the limitations of leading MLLMs, and then conduct a comprehensive empirical study to explore strategies for improving their performance on geometric tasks. Our findings highlight the benefits of certain model architectures, training techniques, and data strategies, including the use of high-fidelity synthetic data and multi-stage training with a data curriculum. Notably, we find that a data curriculum enables models to learn challenging geometry understanding tasks which they fail to learn from scratch. Leveraging these insights, we develop Euclid, a family of models specifically optimized for strong low-level geometric perception. Although purely trained on synthetic multimodal data, Euclid shows strong generalization ability to novel geometry shapes. For instance, Euclid outperforms the best closed-source model, Gemini-1.5-Pro, by up to 58.56% on certain Geoperception benchmark tasks and 10.65% on average across all tasks.
- North America > United States > California (0.14)
- Europe > Switzerland > Zürich > Zürich (0.14)
- North America > Canada > Manitoba > Westman Region > Brandon (0.04)
- Education (0.46)
- Health & Medicine (0.34)
Machine learning-driven Anomaly Detection and Forecasting for Euclid Space Telescope Operations
Gómez, Pablo, Vavrek, Roland D., Buenadicha, Guillermo, Hoar, John, Kruk, Sandor, Reerink, Jan
State-of-the-art space science missions increasingly rely on automation due to spacecraft complexity and the costs of human oversight. The high volume of data, including scientific and telemetry data, makes manual inspection challenging. Machine learning offers significant potential to meet these demands. The Euclid space telescope, in its survey phase since February 2024, exemplifies this shift. Euclid's success depends on accurate monitoring and interpretation of housekeeping telemetry and science-derived data. Thousands of telemetry parameters, monitored as time series, may or may not impact the quality of scientific data. These parameters have complex interdependencies, often due to physical relationships (e.g., proximity of temperature sensors). Optimising science operations requires careful anomaly detection and identification of hidden parameter states. Moreover, understanding the interactions between known anomalies and physical quantities is crucial yet complex, as related parameters may display anomalies with varied timing and intensity. We address these challenges by analysing temperature anomalies in Euclid's telemetry from February to August 2024, focusing on eleven temperature parameters and 35 covariates. We use a predictive XGBoost model to forecast temperatures based on historical values, detecting anomalies as deviations from predictions. A second XGBoost model predicts anomalies from covariates, capturing their relationships to temperature anomalies. We identify the top three anomalies per parameter and analyse their interactions with covariates using SHAP (Shapley Additive Explanations), enabling rapid, automated analysis of complex parameter relationships. Our method demonstrates how machine learning can enhance telemetry monitoring, offering scalable solutions for other missions with similar data challenges.
Euclid: Identification of asteroid streaks in simulated images using deep learning
Pöntinen, M., Granvik, M., Nucita, A. A., Conversi, L., Altieri, B., Carry, B., O'Riordan, C. M., Scott, D., Aghanim, N., Amara, A., Amendola, L., Auricchio, N., Baldi, M., Bonino, D., Branchini, E., Brescia, M., Camera, S., Capobianco, V., Carbone, C., Carretero, J., Castellano, M., Cavuoti, S., Cimatti, A., Cledassou, R., Congedo, G., Copin, Y., Corcione, L., Courbin, F., Cropper, M., Da Silva, A., Degaudenzi, H., Dinis, J., Dubath, F., Dupac, X., Dusini, S., Farrens, S., Ferriol, S., Frailis, M., Franceschi, E., Fumana, M., Galeotta, S., Garilli, B., Gillard, W., Gillis, B., Giocoli, C., Grazian, A., Haugan, S. V. H., Holmes, W., Hormuth, F., Hornstrup, A., Jahnke, K., Kümmel, M., Kermiche, S., Kiessling, A., Kitching, T., Kohley, R., Kunz, M., Kurki-Suonio, H., Ligori, S., Lilje, P. B., Lloro, I., Maiorano, E., Mansutti, O., Marggraf, O., Markovic, K., Marulli, F., Massey, R., Medinaceli, E., Mei, S., Melchior, M., Mellier, Y., Meneghetti, M., Meylan, G., Moresco, M., Moscardini, L., Munari, E., Niemi, S. -M., Nutma, T., Padilla, C., Paltani, S., Pasian, F., Pedersen, K., Pettorino, V., Pires, S., Polenta, G., Poncet, M., Raison, F., Renzi, A., Rhodes, J., Riccio, G., Romelli, E., Roncarelli, M., Rossetti, E., Saglia, R., Sapone, D., Sartoris, B., Schneider, P., Secroun, A., Seidel, G., Serrano, S., Sirignano, C., Sirri, G., Stanco, L., Tallada-Crespí, P., Taylor, A. N., Tereno, I., Toledo-Moreo, R., Torradeflot, F., Tutusaus, I., Valenziano, L., Vassallo, T., Kleijn, G. Verdoes, Wang, Y., Weller, J., Zamorani, G., Zoubian, J., Scottez, V.
Up to 150000 asteroids will be visible in the images of the ESA Euclid space telescope, and the instruments of Euclid offer multiband visual to near-infrared photometry and slitless spectra of these objects. Most asteroids will appear as streaks in the images. Due to the large number of images and asteroids, automated detection methods are needed. A non-machine-learning approach based on the StreakDet software was previously tested, but the results were not optimal for short and/or faint streaks. We set out to improve the capability to detect asteroid streaks in Euclid images by using deep learning. We built, trained, and tested a three-step machine-learning pipeline with simulated Euclid images. First, a convolutional neural network (CNN) detected streaks and their coordinates in full images, aiming to maximize the completeness (recall) of detections. Then, a recurrent neural network (RNN) merged snippets of long streaks detected in several parts by the CNN. Lastly, gradient-boosted trees (XGBoost) linked detected streaks between different Euclid exposures to reduce the number of false positives and improve the purity (precision) of the sample. The deep-learning pipeline surpasses the completeness and reaches a similar level of purity of a non-machine-learning pipeline based on the StreakDet software. Additionally, the deep-learning pipeline can detect asteroids 0.25-0.5 magnitudes fainter than StreakDet. The deep-learning pipeline could result in a 50% increase in the number of detected asteroids compared to the StreakDet software. There is still scope for further refinement, particularly in improving the accuracy of streak coordinates and enhancing the completeness of the final stage of the pipeline, which involves linking detections across multiple exposures.
- Europe > United Kingdom (0.14)
- Europe > Italy > Emilia-Romagna > Metropolitan City of Bologna > Bologna (0.05)
- Europe > Portugal > Lisbon > Lisbon (0.04)
- (28 more...)
- Government > Space Agency (0.46)
- Government > Regional Government (0.45)
Quantum computing is the key to consciousness
With the rapid development of chatbots and other AI systems, questions about whether they will ever gain true understanding, become conscious, or even develop a feeling agency have become more pressing. When it comes to making sense of these qualities in humans, our ability for counterfactual thinking is key. The existence of alternative worlds where things happen differently, however, is not just an exercise in imagination – it's a key prediction of quantum mechanics. Perhaps our brains are able to ponder how things could have been because in essence they are quantum computers, accessing information from alternative worlds, argues Tim Palmer. Ask a chatbot "How many prime numbers are there?"[i] Ask the chatbot "How do we know?" and it will reply that there are many ways to show this, the original going back to the mathematician Euclid of ancient Greece.
- Europe > Greece (0.24)
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model
Yuan, Yifu, Hao, Jianye, Ni, Fei, Mu, Yao, Zheng, Yan, Hu, Yujing, Liu, Jinyi, Chen, Yingfeng, Fan, Changjie
Unsupervised reinforcement learning (URL) poses a promising paradigm to learn useful behaviors in a task-agnostic environment without the guidance of extrinsic rewards to facilitate the fast adaptation of various downstream tasks. Previous works focused on the pre-training in a model-free manner while lacking the study of transition dynamics modeling that leaves a large space for the improvement of sample efficiency in downstream tasks. To this end, we propose an Efficient Unsupervised Reinforcement Learning Framework with Multi-choice Dynamics model (EUCLID), which introduces a novel model-fused paradigm to jointly pre-train the dynamics model and unsupervised exploration policy in the pre-training phase, thus better leveraging the environmental samples and improving the downstream task sampling efficiency. However, constructing a generalizable model which captures the local dynamics under different behaviors remains a challenging problem. We introduce the multi-choice dynamics model that covers different local dynamics under different behaviors concurrently, which uses different heads to learn the state transition under different behaviors during unsupervised pre-training and selects the most appropriate head for prediction in the downstream task. Experimental results in the manipulation and locomotion domains demonstrate that EUCLID achieves state-of-the-art performance with high sample efficiency, basically solving the state-based URLB benchmark and reaching a mean normalized score of 104.0$\pm$1.2$\%$ in downstream tasks with 100k fine-tuning steps, which is equivalent to DDPG's performance at 2M interactive steps with 20x more data.
- North America > United States (0.46)
- Asia > China (0.46)
How Breaking the Rules of Math Will Give Us an Edge Over AI
It is our human nature to tinker with rules, as anyone who has argued over dubious interpretations of the game Monopoly will attest. Rule-breaking may hold the key to giving us an edge over machines that, for all their emerging capabilities, are tied to fixed directives. A close look at how mathematics comes to be demonstrates the power of departing from accepted convention, mathematician and author Junaid Mubeen writes in Mathematical Intelligence: A Story of Human Superiority Over Machines.How Humans have long been dreaming up concepts and creatures that exist outside our physical reality. The oldest known figurative art object, discovered in a cave in the Lone Valley of south-western Germany, is the Lion Man of Hohlenstein-Stadel, a chimeric figurine that is half-human, half-lion. Sculpted around 40,000 years ago for purposes unknown, the Lion Man is a product of pure human imagination.