Goto

Collaborating Authors

 compaction


Operand Quant: A Single-Agent Architecture for Autonomous Machine Learning Engineering

arXiv.org Artificial Intelligence

We present Operand Quant, a single-agent, IDE-based architecture for autonomous machine learning engineering (MLE). Operand Quant departs from conventional multi-agent orchestration frameworks by consolidating all MLE lifecycle stages -- exploration, modeling, experimentation, and deployment -- within a single, context-aware agent. On the MLE-Benchmark (2025), Operand Quant achieved a new state-of-the-art (SOTA) result, with an overall medal rate of 0.3956 +/- 0.0565 across 75 problems -- the highest recorded performance among all evaluated systems to date. The architecture demonstrates that a linear, non-blocking agent, operating autonomously within a controlled IDE environment, can outperform multi-agent and orchestrated systems under identical constraints.


Integrating Intent Understanding and Optimal Behavior Planning for Behavior Tree Generation from Human Instructions

arXiv.org Artificial Intelligence

Robots executing tasks following human instructions in domestic or industrial environments essentially require both adaptability and reliability. Behavior Tree (BT) emerges as an appropriate control architecture for these scenarios due to its modularity and reactivity. Existing BT generation methods, however, either do not involve interpreting natural language or cannot theoretically guarantee the BTs' success. This paper proposes a two-stage framework for BT generation, which first employs large language models (LLMs) to interpret goals from high-level instructions, then constructs an efficient goal-specific BT through the Optimal Behavior Tree Expansion Algorithm (OBTEA). We represent goals as well-formed formulas in first-order logic, effectively bridging intent understanding and optimal behavior planning. Experiments in the service robot validate the proficiency of LLMs in producing grammatically correct and accurately interpreted goals, demonstrate OBTEA's superiority over the baseline BT Expansion algorithm in various metrics, and finally confirm the practical deployability of our framework. The project website is https://dids-ei.github.io/Project/LLM-OBTEA/.


Transformer Based Model for Predicting Rapid Impact Compaction Outcomes: A Case Study of Utapao International Airport

arXiv.org Artificial Intelligence

It is often used in large infrastructure projects such as airports and highways, where the soil needs to support the weight of the structure and pavement (Cheng et al. 2021; Mohammed et al. 2013; Simpson et al. 2008; Spyropoulos et al. 2020; Tarawneh and Matraji 2014; Vukadin 2013). The effectiveness of RIC depends on various factors, such as the fine content of the soil, the compaction sequence, the energy applied, the stiffness of existing ground, the ground water characteristics and the soil drainage. These factors vary in different site conditions and need to be considered in the design of RIC to optimize the compaction method (Ghanbari and Hamidi 2014; Serridge and Synac 2006; Tarawneh and Matraji 2014). Therefore, it is recommended to conduct a trial before the actual construction. Predicting the engineering properties of the ground improved by Rapid Impact Compaction (RIC) is a challenging task for geotechnical engineers.


Learned Interpolation for Better Streaming Quantile Approximation with Worst-Case Guarantees

arXiv.org Artificial Intelligence

An $\varepsilon$-approximate quantile sketch over a stream of $n$ inputs approximates the rank of any query point $q$ - that is, the number of input points less than $q$ - up to an additive error of $\varepsilon n$, generally with some probability of at least $1 - 1/\mathrm{poly}(n)$, while consuming $o(n)$ space. While the celebrated KLL sketch of Karnin, Lang, and Liberty achieves a provably optimal quantile approximation algorithm over worst-case streams, the approximations it achieves in practice are often far from optimal. Indeed, the most commonly used technique in practice is Dunning's t-digest, which often achieves much better approximations than KLL on real-world data but is known to have arbitrarily large errors in the worst case. We apply interpolation techniques to the streaming quantiles problem to attempt to achieve better approximations on real-world data sets than KLL while maintaining similar guarantees in the worst case.


Optimality-based Analysis of XCSF Compaction in Discrete Reinforcement Learning

arXiv.org Machine Learning

Learning classifier systems (LCSs) are population-based predictive systems that were originally envisioned as agents to act in reinforcement learning (RL) environments. These systems can suffer from population bloat and so are amenable to compaction techniques that try to strike a balance between population size and performance. A well-studied LCS architecture is XCSF, which in the RL setting acts as a Q-function approximator. We apply XCSF to a deterministic and stochastic variant of the FrozenLake8x8 environment from OpenAI Gym, with its performance compared in terms of function approximation error and policy accuracy to the optimal Q-functions and policies produced by solving the environments via dynamic programming. We then introduce a novel compaction algorithm (Greedy Niche Mass Compaction - GNMC) and study its operation on XCSF's trained populations. Results show that given a suitable parametrisation, GNMC preserves or even slightly improves function approximation error while yielding a significant reduction in population size. Reasonable preservation of policy accuracy also occurs, and we link this metric to the commonly used steps-to-goal metric in maze-like environments, illustrating how the metrics are complementary rather than competitive.


Discrepancy, Coresets, and Sketches in Machine Learning

arXiv.org Machine Learning

This paper defines the notion of class discrepancy for families of functions. It shows that low discrepancy classes admit small offline and streaming coresets. We provide general techniques for bounding the class discrepancy of machine learning problems. As corollaries of the general technique we bound the discrepancy (and therefore coreset complexity) of logistic regression, sigmoid activation loss, matrix covariance, kernel density and any analytic function of the dot product or the squared distance. Our results prove the existence of epsilon-approximation O(sqrt{d}/epsilon) sized coresets for the above problems. This resolves the long-standing open problem regarding the coreset complexity of Gaussian kernel density estimation. We provide two more related but independent results. First, an exponential improvement of the widely used merge-and-reduce trick which gives improved streaming sketches for any low discrepancy problem. Second, an extremely simple deterministic algorithm for finding low discrepancy sequences (and therefore coresets) for any positive semi-definite kernel. This paper establishes some explicit connections between class discrepancy, coreset complexity, learnability, and streaming algorithms.


Hands-free farming using autonomous tractors and drones

Daily Mail - Science & tech

A team of agricultural engineers are attempting a world-first of growing and harvesting a field of cereal crop without a human setting foot on the land. Researchers have pioneered an autonomous tractor which can be steered by a farmer from a control room to carry out the drilling, seeding and spraying of the land. Then an automated combine harvester will harvest the field in the ground-breaking project. Researchers have pioneered an autonomous tractor which can be steered by a farmer from a control room to carry out the drilling, seeding and spraying of the land. Drones are also being used to monitor the crops so agronomists don't have to enter the field to carry out their observations.