AITopics | octree

Optimized Feature Generation for Tabular Data via LLMs with Decision Tree Reasoning

Neural Information Processing SystemsMar-21-2026, 23:27:21 GMT

In tabular prediction tasks, tree-based models combined with automated feature engineering methods often outperform deep learning approaches that rely on learned representations. While these feature engineering techniques are effective, they typically depend on a pre-defined search space and primarily use validation scores for feature selection, thereby missing valuable insights from previous experiments.To address these limitations, we propose a novel tabular learning framework that utilizes large language models (LLMs), termed Optimizing Column feature generator with decision Tree reasoning (OCTree). Our key idea is to leverage the reasoning capabilities of LLMs to identify effective feature generation rules without manually specifying the search space and provide language-based reasoning information highlighting past experiments as feedback for iterative rule improvements. We use decision trees to convey this reasoning information, as they can be easily represented in natural language, effectively providing knowledge from prior experiments (i.e., the impact of the generated features on performance) to the LLMs. Our empirical results demonstrate that OCTree consistently enhances the performance of various prediction models across diverse benchmarks, outperforming competing automated feature engineering methods.

Add feedback

Optimized Feature Generation for Tabular Data via LLMs with Decision Tree Reasoning

Neural Information Processing SystemsFeb-17-2026, 06:30:45 GMT

We use decision trees to convey this reasoning information, as they can be easily represented in natural language, effectively providing knowledge from prior experiments ( i.e., the impact of the generated features on performance) to

large language model, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Country:

North America > United States > California (0.04)
South America > Paraguay > Asunción > Asunción (0.04)
North America > Montserrat (0.04)
Europe > Germany > Bavaria > Lower Franconia > Würzburg (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

Add feedback

MuSCLE: MultiSweepCompressionofLiDARusing DeepEntropyModels

Neural Information Processing SystemsFeb-11-2026, 05:42:38 GMT

We present a novel compression algorithm for reducing the storage of LiDAR sensor data streams.

artificial intelligence, deep learning, machine learning, (19 more...)

Neural Information Processing Systems

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.14)
Asia > South Korea > Seoul > Seoul (0.05)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(4 more...)

Technology:

Information Technology > Sensing and Signal Processing (0.89)
Information Technology > Artificial Intelligence > Robots (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

$\nabla$-SDF: Learning Euclidean Signed Distance Functions Online with Gradient-Augmented Octree Interpolation and Neural Residual

Dai, Zhirui, Qian, Qihao, Fan, Tianxing, Atanasov, Nikolay

arXiv.org Artificial IntelligenceOct-23-2025

Abstract-- Estimation of signed distance functions (SDFs) from point cloud data has been shown to benefit many robot autonomy capabilities, including localization, mapping, motion planning, and control. Methods that support online and large-scale SDF reconstruction tend to rely on discrete volumetric data structures, which affect the continuity and differentiability of the SDF estimates. Recently, using implicit features, neural network methods have demonstrated high-fidelity and differentiable SDF reconstruction but they tend to be less efficient, can experience catastrophic forgetting and memory limitations in large environments, and are often restricted to truncated SDFs. This work proposes -SDF, a hybrid method that combines an explicit prior obtained from gradient-augmented octree interpolation with an implicit neural residual. Our method achieves non-truncated (Euclidean) SDF reconstruction with computational and memory efficiency comparable to volumetric methods and differentiability and accuracy comparable to neural network methods. Extensive experiments demonstrate that -SDF outperforms the state of the art in terms of accuracy and efficiency, providing a scalable solution for downstream tasks in robotics and computer vision. Accurate and differentiable geometric environment representations are critical for many functions in robot autonomy and computer vision, including simultaneous localization and mapping [1]-[3], rendering and AR/VR [4]-[6], autonomous navigation [7], [8] and manipulation [9]-[11].

artificial intelligence, interpolation, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2510.18999

Country: North America > United States > California (0.28)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.71)
Information Technology > Artificial Intelligence > Robots > Robot Planning & Action (0.54)

Add feedback

Optimized Feature Generation for Tabular Data via LLMs with Decision Tree Reasoning

Neural Information Processing SystemsOct-10-2025, 12:34:01 GMT

We use decision trees to convey this reasoning information, as they can be easily represented in natural language, effectively providing knowledge from prior experiments ( i.e., the impact of the generated features on performance) to

dataset, llm, octree, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > California (0.04)
South America > Paraguay > Asunción > Asunción (0.04)
North America > Montserrat (0.04)
Europe > Germany > Bavaria > Lower Franconia > Würzburg (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

Add feedback

A Preview of HoloOcean 2.0

Romrell, Blake, Austin, Abigail, Meyers, Braden, Anderson, Ryan, Noh, Carter, Mangelson, Joshua G.

arXiv.org Artificial IntelligenceOct-8-2025

Abstract-- Marine robotics simulators play a fundamental role in the development of marine robotic systems. With increased focus on the marine robotics field in recent years, there has been significant interest in developing higher fidelity simulation of marine sensors, physics, and visual rendering capabilities to support autonomous marine robot development and validation. HoloOcean 2.0, the next major release of HoloOcean, brings state-of-the-art features under a general marine simulator capable of supporting a variety of tasks. New features in HoloOcean 2.0 include migration to Unreal Engine (UE) 5.3, advanced vehicle dynamics using models from Fossen, and support for ROS2 using a custom bridge. Additional features are currently in development, including significantly more efficient ray tracing-based sidescan, forward-looking, and bathymetric sonar implementations; semantic sensors; environment generation tools; volumetric environmental effects; and realistic waves. Marine robotics simulators have supported research and development for autonomous underwater and surface vessels for several decades.

artificial intelligence, holoocean, implementation, (18 more...)

arXiv.org Artificial Intelligence

2510.0616

Country: Europe (0.28)

Genre: Research Report (0.40)

Industry: Government (0.48)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback

MuSCLE: Multi Sweep Compression of LiDAR using Deep Entropy Models

Neural Information Processing SystemsAug-17-2025, 09:46:18 GMT

We present a novel compression algorithm for reducing the storage of LiDAR sensor data streams.

artificial intelligence, deep learning, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > California > Los Angeles County > Long Beach (0.04)
Asia > South Korea > Seoul > Seoul (0.04)
(8 more...)

Technology:

Information Technology > Sensing and Signal Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(2 more...)

Add feedback

Optimized Feature Generation for Tabular Data via LLMs with Decision Tree Reasoning

Neural Information Processing SystemsMay-27-2025, 11:53:28 GMT

In tabular prediction tasks, tree-based models combined with automated feature engineering methods often outperform deep learning approaches that rely on learned representations. While these feature engineering techniques are effective, they typically depend on a pre-defined search space and primarily use validation scores for feature selection, thereby missing valuable insights from previous experiments.To address these limitations, we propose a novel tabular learning framework that utilizes large language models (LLMs), termed Optimizing Column feature generator with decision Tree reasoning (OCTree). Our key idea is to leverage the reasoning capabilities of LLMs to identify effective feature generation rules without manually specifying the search space and provide language-based reasoning information highlighting past experiments as feedback for iterative rule improvements. We use decision trees to convey this reasoning information, as they can be easily represented in natural language, effectively providing knowledge from prior experiments (i.e., the impact of the generated features on performance) to the LLMs. Our empirical results demonstrate that OCTree consistently enhances the performance of various prediction models across diverse benchmarks, outperforming competing automated feature engineering methods.

Add feedback

ZeroGrasp: Zero-Shot Shape Reconstruction Enabled Robotic Grasping

Iwase, Shun, Irshad, Zubair, Liu, Katherine, Guizilini, Vitor, Lee, Robert, Ikeda, Takuya, Amma, Ayako, Nishiwaki, Koichi, Kitani, Kris, Ambrus, Rares, Zakharov, Sergey

arXiv.org Artificial IntelligenceApr-16-2025

Robotic grasping is a cornerstone capability of embodied systems. Many methods directly output grasps from partial information without modeling the geometry of the scene, leading to suboptimal motion and even collisions. To address these issues, we introduce ZeroGrasp, a novel framework that simultaneously performs 3D reconstruction and grasp pose prediction in near real-time. A key insight of our method is that occlusion reasoning and modeling the spatial relationships between objects is beneficial for both accurate reconstruction and grasping. We couple our method with a novel large-scale synthetic dataset, which comprises 1M photo-realistic images, high-resolution 3D reconstructions and 11.3B physically-valid grasp pose annotations for 12K objects from the Objaverse-LVIS dataset. We evaluate ZeroGrasp on the GraspNet-1B benchmark as well as through real-world robot experiments. ZeroGrasp achieves state-of-the-art performance and generalizes to novel real-world objects by leveraging synthetic data.

artificial intelligence, deep learning, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2504.10857

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Optimized Feature Generation for Tabular Data via LLMs with Decision Tree Reasoning

Nam, Jaehyun, Kim, Kyuyoung, Oh, Seunghyuk, Tack, Jihoon, Kim, Jaehyung, Shin, Jinwoo

arXiv.org Artificial IntelligenceJun-12-2024

Learning effective representations from raw data is crucial for the success of deep learning methods. However, in the tabular domain, practitioners often prefer augmenting raw column features over using learned representations, as conventional tree-based algorithms frequently outperform competing approaches. As a result, feature engineering methods that automatically generate candidate features have been widely used. While these approaches are often effective, there remains ambiguity in defining the space over which to search for candidate features. Moreover, they often rely solely on validation scores to select good features, neglecting valuable feedback from past experiments that could inform the planning of future experiments. To address the shortcomings, we propose a new tabular learning framework based on large language models (LLMs), coined Optimizing Column feature generator with decision Tree reasoning (OCTree). Our key idea is to leverage LLMs' reasoning capabilities to find good feature generation rules without manually specifying the search space and provide language-based reasoning information highlighting past experiments as feedback for iterative rule improvements. Here, we choose a decision tree as reasoning as it can be interpreted in natural language, effectively conveying knowledge of past experiments (i.e., the prediction models trained with the generated features) to the LLM. Our empirical results demonstrate that this simple framework consistently enhances the performance of various prediction models across diverse tabular benchmarks, outperforming competing automatic feature engineering methods.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2406.08527

Country:

North America > United States > California (0.04)
South America > Paraguay > Asunción > Asunción (0.04)
North America > Montserrat (0.04)
Europe > Germany > Bavaria > Lower Franconia > Würzburg (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Filters

Collaborating Authors

octree

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Optimized Feature Generation for Tabular Data via LLMs with Decision Tree Reasoning

Optimized Feature Generation for Tabular Data via LLMs with Decision Tree Reasoning

MuSCLE: MultiSweepCompressionofLiDARusing DeepEntropyModels

$\nabla$-SDF: Learning Euclidean Signed Distance Functions Online with Gradient-Augmented Octree Interpolation and Neural Residual

Optimized Feature Generation for Tabular Data via LLMs with Decision Tree Reasoning

A Preview of HoloOcean 2.0

MuSCLE: Multi Sweep Compression of LiDAR using Deep Entropy Models

Optimized Feature Generation for Tabular Data via LLMs with Decision Tree Reasoning

ZeroGrasp: Zero-Shot Shape Reconstruction Enabled Robotic Grasping

Optimized Feature Generation for Tabular Data via LLMs with Decision Tree Reasoning