PhyloLM : Inferring the Phylogeny of Large Language Models and Predicting their Performances in Benchmarks

Yax, Nicolas, Oudeyer, Pierre-Yves, Palminteri, Stefano

Jun-16-2024–arXiv.org Artificial Intelligence

This paper introduces PhyloLM, a method adapting phylogenetic algorithms to Large Language Models (LLMs) to explore whether and how they relate to each other and to predict their performance characteristics. Our method calculates a phylogenetic distance metrics based on the similarity of LLMs' output. The resulting metric is then used to construct dendrograms, which satisfactorily capture known relationships across a set of 111 open-source and 45 closed models. Furthermore, our phylogenetic distance predicts performance in standard benchmarks, thus demonstrating its functional validity and paving the way for a time and cost-effective estimation of LLM capabilities. To sum up, by translating population genetic concepts to machine learning, we propose and validate a tool to evaluate LLM development, relationships and capabilities, even in the absence of transparent training information.

distance matrix, matrix, similarity matrix, (16 more...)

arXiv.org Artificial Intelligence

Jun-16-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - California > Santa Clara County > Palo Alto (0.04)
- Europe > France
  - Île-de-France > Paris
    - Paris (0.04)
  - Nouvelle-Aquitaine > Gironde
    - Bordeaux (0.04)
- Asia > Middle East
  - Iran > Tehran Province > Tehran (0.04)

Genre:
- Research Report > New Finding (0.93)

Industry:
- Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.99)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found