AITopics | Friede, David

Collaborating Authors

Friede, David

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

AgentQuest: A Modular Benchmark Framework to Measure Progress and Improve LLM Agents

Gioacchini, Luca, Siracusano, Giuseppe, Sanvito, Davide, Gashteovski, Kiril, Friede, David, Bifulco, Roberto, Lawrence, Carolin

arXiv.org Artificial IntelligenceApr-9-2024

The advances made by Large Language Models (LLMs) have led to the pursuit of LLM agents that can solve intricate, multi-step reasoning tasks. As with any research pursuit, benchmarking and evaluation are key corner stones to efficient and reliable progress. However, existing benchmarks are often narrow and simply compute overall task success. To face these issues, we propose AgentQuest -- a framework where (i) both benchmarks and metrics are modular and easily extensible through well documented and easy-to-use APIs; (ii) we offer two new evaluation metrics that can reliably track LLM agent progress while solving a task. We exemplify the utility of the metrics on two use cases wherein we identify common failure points and refine the agent architecture to obtain a significant performance increase. Together with the research community, we hope to extend AgentQuest further and therefore we make it available under https://github.com/nec-research/agentquest.

artificial intelligence, large language model, natural language, (17 more...)

arXiv.org Artificial Intelligence

2404.06411

Country:

Europe > North Macedonia (0.14)
Europe > Italy (0.14)
Europe > Germany (0.14)

Genre: Research Report (1.00)

Industry: Leisure & Entertainment > Games (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Efficient Learning of Discrete-Continuous Computation Graphs

Friede, David, Niepert, Mathias

arXiv.org Artificial IntelligenceJul-26-2023

Numerous models for supervised and reinforcement learning benefit from combinations of discrete and continuous model components. End-to-end learnable discrete-continuous models are compositional, tend to generalize better, and are more interpretable. A popular approach to building discrete-continuous computation graphs is that of integrating discrete probability distributions into neural networks using stochastic softmax tricks. Prior work has mainly focused on computation graphs with a single discrete component on each of the graph's execution paths. We analyze the behavior of more complex stochastic computations graphs with multiple sequential discrete components. We show that it is challenging to optimize the parameters of these models, mainly due to small gradients and local minima. We then propose two new strategies to overcome these challenges. First, we show that increasing the scale parameter of the Gumbel noise perturbations during training improves the learning behavior. Second, we propose dropout residual connections specifically tailored to stochastic, discrete-continuous computation graphs. With an extensive set of experiments, we show that we can train complex discrete-continuous models which one cannot train with standard stochastic softmax tricks. We also show that complex discrete-stochastic models generalize better than their continuous counterparts on several benchmark datasets.

artificial intelligence, logic & formal reasoning, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2307.14193

Country:

Europe (0.93)
North America > United States > New York (0.14)
North America > Canada > Quebec (0.14)

Genre: Research Report > Promising Solution (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Learning Disentangled Discrete Representations

Friede, David, Reimers, Christian, Stuckenschmidt, Heiner, Niepert, Mathias

arXiv.org Artificial IntelligenceJul-26-2023

Recent successes in image generation, model-based reinforcement learning, and text-to-image generation have demonstrated the empirical advantages of discrete latent representations, although the reasons behind their benefits remain unclear. We explore the relationship between discrete latent spaces and disentangled representations by replacing the standard Gaussian variational autoencoder (VAE) with a tailored categorical variational autoencoder. We show that the underlying grid structure of categorical distributions mitigates the problem of rotational invariance associated with multivariate Gaussian distributions, acting as an efficient inductive prior for disentangled representations. We provide both analytical and empirical findings that demonstrate the advantages of discrete VAEs for learning disentangled representations. Furthermore, we introduce the first unsupervised model selection strategy that favors disentangled representations.

artificial intelligence, latent space, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2307.14151

Country: Europe > Germany > Baden-Württemberg (0.14)

Genre: Research Report > New Finding (0.92)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Smooth Variational Graph Embeddings for Efficient Neural Architecture Search

Lukasik, Jovita, Friede, David, Zela, Arber, Stuckenschmidt, Heiner, Hutter, Frank, Keuper, Margret

arXiv.org Artificial IntelligenceOct-9-2020

This leads to the desire of an accurate space encoding that enables performance prediction In this paper, we propose an approach to neural architecture via surrogates and black-box optimization to find search (NAS) based on graph embeddings. NAS has high-performing architectures in a continuous search space been addressed previously using discrete, sampling based [67]. Zhang et al. [67] propose D-VAE, a graph neural network methods, which are computationally expensive as well as (GNN) [14, 23, 56] based variational neural architecture differentiable approaches, which come at lower costs but embedding with emphasis on the information flow and enforce stronger constraints on the search space. The proposed thereby achieve good results in architecture performance approach leverages advantages from both sides by prediction and BO on the ENAS search space [39] and on a building a smooth variational neural architecture embedding dataset of Bayesian Networks.

deep learning, graph, neural network, (18 more...)

arXiv.org Artificial Intelligence

2010.04683

Country:

Europe > Germany (0.14)
North America > Canada > Ontario > Toronto (0.14)
Asia > Russia (0.14)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.34)

Add feedback