AITopics | carte

Collaborating Authors

carte

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Greedy Feature Construction

Neural Information Processing SystemsNov-21-2025, 05:06:13 GMT

We present an effective method for supervised feature construction.

artificial intelligence, machine learning, sequence, (18 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Nottinghamshire > Nottingham (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)
(3 more...)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

TabGemma: Text-Based Tabular ICL via LLM using Continued Pretraining and Retrieval

Schindler, Günther, Schambach, Maximilian, Medek, Michael, Thelin, Sam

arXiv.org Artificial IntelligenceNov-6-2025

We study LLMs for tabular prediction with mixed text, numeric, and categorical fields. We introduce TabGemma, a schema-agnostic in-context learner that treats rows as sequences and tackles two practical hurdles when adapting pretrained LLMs for tabular predictions: unstable numeric tokenization and limited context size. We propose to canonicalize numbers via signed scientific notation and continue pretraining of a 12B Gemma 3 model with a target imputation objective using a large-scale real world dataset. For inference, we use a compact n-gram-based retrieval to select informative exemplars that fit within a 128k-token window. On semantically rich benchmarks, TabGemma establishes a new state of the art on classification across low- and high-data regimes and improves monotonically with more context rows. For regression, it is competitive at small sample sizes but trails conventional approaches as data grows. Our results show that LLMs can be effective tabular in-context learners on highly semantic tasks when paired with dedicated numeric handling and context retrieval, while motivating further advances in numeric modeling and long-context scaling.

benchmark, large language model, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2511.0357

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

Towards Benchmarking Foundation Models for Tabular Data With Text

Mráz, Martin, Das, Breenda, Gupta, Anshul, Purucker, Lennart, Hutter, Frank

arXiv.org Artificial IntelligenceJul-11-2025

Foundation models for tabular data are rapidly evolving, with increasing interest in extending them to support additional modalities such as free-text features. However, existing benchmarks for tabular data rarely include textual columns, and identifying real-world tabular datasets with semantically rich text features is non-trivial. We propose a series of simple yet effective ablation-style strategies for incorporating text into conventional tabular pipelines. Moreover, we benchmark how state-of-the-art tabular foundation models can handle textual data by manually curating a collection of real-world tabular datasets with meaningful textual features. Our study is an important step towards improving benchmarking of foundation models for tabular data with text.

data mining, large language model, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2507.07829

Country:

Europe > Germany > Baden-Württemberg > Freiburg (0.04)
Asia > Pakistan (0.04)
Europe > Italy (0.04)
(5 more...)

Genre: Research Report > New Finding (0.67)

Industry:

Leisure & Entertainment (1.00)
Consumer Products & Services > Food, Beverage, Tobacco & Cannabis > Beverages (0.67)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(3 more...)

Add feedback

Photometric Stereo using Gaussian Splatting and inverse rendering

Ducastel, Matéo, Tschumperlé, David, Quéau, Yvain

arXiv.org Artificial IntelligenceJul-10-2025

Recent state-of-the-art algorithms in photometric stereo rely on neural networks and operate either through prior learning or inverse rendering optimization. Here, we revisit the problem of calibrated photometric stereo by leveraging recent advances in 3D inverse rendering using the Gaussian Splatting formalism. This allows us to parameterize the 3D scene to be reconstructed and optimize it in a more interpretable manner. Our approach incorporates a simplified model for light representation and demonstrates the potential of the Gaussian Splatting rendering engine for the photometric stereo problem.

artificial intelligence, carte, gaussian splatting, (16 more...)

arXiv.org Artificial Intelligence

2507.06684

Country: Europe > France (0.04)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Vision (0.96)

Add feedback

CARTE: Pretraining and Transfer for Tabular Learning

Kim, Myung Jun, Grinsztajn, Léo, Varoquaux, Gaël

arXiv.org Artificial IntelligenceMay-31-2024

Pretrained deep-learning models are the go-to solution for images or text. However, for tabular data the standard is still to train tree-based models. Indeed, transfer learning on tables hits the challenge of data integration: finding correspondences, correspondences in the entries (entity matching) where different words may denote the same entity, correspondences across columns (schema matching), which may come in different orders, names... We propose a neural architecture that does not need such correspondences. As a result, we can pretrain it on background data that has not been matched. The architecture -- CARTE for Context Aware Representation of Table Entries -- uses a graph representation of tabular (or relational) data to process tables with different columns, string embedding of entries and columns names to model an open vocabulary, and a graph-attentional network to contextualize entries with column names and neighboring entries. An extensive benchmark shows that CARTE facilitates learning, outperforming a solid set of baselines including the best tree-based models. CARTE also enables joint learning across tables with unmatched columns, enhancing a small table with bigger ones. CARTE opens the door to large pretrained models for tabular data.

artificial intelligence, carte, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2402.16785

Country:

North America > United States > California (0.14)
Europe > Austria > Vienna (0.14)
Europe > France (0.04)
(6 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.67)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Health & Medicine (0.93)
Leisure & Entertainment > Sports (0.93)
Consumer Products & Services > Food, Beverage, Tobacco & Cannabis > Beverages (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Greedy Feature Construction School of Computer Science Universität Bonn, Germany The University of Nottingham, UK

Neural Information Processing SystemsMar-12-2024, 08:48:00 GMT

We present an effective method for supervised feature construction. The main goal of the approach is to construct a feature representation for which a set of linear hypotheses is of sufficient capacity - large enough to contain a satisfactory solution to the considered problem and small enough to allow good generalization from a small number of training examples. We achieve this goal with a greedy procedure that constructs features by empirically fitting squared error residuals. The proposed constructive procedure is consistent and can output a rich set of features. The effectiveness of the approach is evaluated empirically by fitting a linear ridge regression model in the constructed feature space and our empirical results indicate a superior performance of our approach over competing methods.

artificial intelligence, machine learning, sequence, (18 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Nottinghamshire > Nottingham (0.86)
Europe > Germany > North Rhine-Westphalia > Cologne Region > Bonn (0.40)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(3 more...)

Genre: Research Report (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.35)

Add feedback

ScoreCAM GNN: une explication optimale des r\'eseaux profonds sur graphes

Raison, Adrien, Bourdon, Pascal, Helbert, David

arXiv.org Artificial IntelligenceJul-26-2022

The explainability of deep networks is becoming a central issue in the deep learning community. It is the same for learning on graphs, a data structure present in many real world problems. In this paper, we propose a method that is more optimal, lighter, consistent and better exploits the topology of the evaluated graph than the state-of-the-art methods.

artificial intelligence, carte, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2207.12748

Country:

North America > United States > Washington > King County > Seattle (0.04)
North America > United States > Nevada > Clark County > Las Vegas (0.04)
North America > United States > Hawaii > Honolulu County > Honolulu (0.04)

Genre: Research Report (0.69)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

Pedagogical Agent Research at CARTE

AI MagazineJan-4-2018, 17:19:38 GMT

This article gives an overview of current research on animated pedagogical agents at the Center for Advanced Research in Technology for Education (CARTE) at the University of Southern California/Information Sciences Institute. Animated pedagogical agents, nicknamed guidebots, interact with learners to help keep learning activities on track. They combine the pedagogical expertise of intelligent tutoring systems with the interpersonal interaction capabilities of embodied conversational characters. They can support the acquisition of team skills as well as skills performed alone by individuals. At CARTE, we have been developing guidebots that help learners acquire a variety of problem-solving skills in virtual worlds, in multimedia environments, and on the web.

computer based training, educational technology, guidebot, (22 more...)

AI Magazine

Genre: Overview (1.00)

Industry: Education > Educational Technology > Educational Software > Computer Based Training (0.88)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbots (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)

Add feedback

Greedy Feature Construction

Oglic, Dino, Gärtner, Thomas

Neural Information Processing SystemsDec-31-2016

We present an effective method for supervised feature construction. The main goal of the approach is to construct a feature representation for which a set of linear hypotheses is of sufficient capacity -- large enough to contain a satisfactory solution to the considered problem and small enough to allow good generalization from a small number of training examples. We achieve this goal with a greedy procedure that constructs features by empirically fitting squared error residuals. The proposed constructive procedure is consistent and can output a rich set of features. The effectiveness of the approach is evaluated empirically by fitting a linear ridge regression model in the constructed feature space and our empirical results indicate a superior performance of our approach over competing methods.

artificial intelligence, machine learning, sequence, (18 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Nottinghamshire > Nottingham (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)
(3 more...)

Genre: Research Report (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.35)

Add feedback

Pedagogical Agent Research at CARTE

Johnson, W. Lewis

AI MagazineDec-15-2001

This article gives an overview of current research on animated pedagogical agents at the Center for Advanced Research in Technology for Education (CARTE) at the University of Southern California/Information Sciences Institute. Animated pedagogical agents, nicknamed guidebots, interact with learners to help keep learning activities on track. At CARTE, we have been developing guidebots that help learners acquire a variety of problem-solving skills in virtual worlds, in multimedia environments, and on the web. We are also developing technologies for creating interactive pedagogical dramas populated with guidebots and other autonomous animated characters.

artificial intelligence, pedagogical agent, survey article, (9 more...)

AI Magazine

Genre: Overview (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Chatbots (0.99)

Add feedback