AITopics | model discovery

Collaborating Authors

model discovery

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Automated Model Discovery via Multi-modal & Multi-step Pipeline

Neural Information Processing SystemsJun-22-2026, 23:07:08 GMT

Automated model discovery is the process of automatically searching and identifying the most appropriate model for a given dataset over a large combinatorial search space.

large language model, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country: North America (0.28)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.96)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.67)

Add feedback

Automated Model Discovery via Multi-modal & Multi-step Pipeline

Neural Information Processing SystemsJun-14-2026, 05:56:58 GMT

Automated model discovery is the process of automatically searching and identifying the most appropriate model for a given dataset over a large combinatorial search space.

artificial intelligence, name change, proceedings, (4 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.38)

Technology: Information Technology > Artificial Intelligence (0.81)

Add feedback

Automated Model Discovery via Multi-modal & Multi-step Pipeline

Jung-Mok, Lee, Hyeon-Woo, Nam, Ye-Bin, Moon, Nam, Junhyun, Oh, Tae-Hyun

arXiv.org Artificial IntelligenceOct-1-2025

Automated model discovery is the process of automatically searching and identifying the most appropriate model for a given dataset over a large combinatorial search space. Existing approaches, however, often face challenges in balancing the capture of fine-grained details with ensuring generalizability beyond training data regimes with a reasonable model complexity. In this paper, we present a multi-modal \& multi-step pipeline for effective automated model discovery. Our approach leverages two vision-language-based modules (VLM), AnalyzerVLM and EvaluatorVLM, for effective model proposal and evaluation in an agentic way. AnalyzerVLM autonomously plans and executes multi-step analyses to propose effective candidate models. EvaluatorVLM assesses the candidate models both quantitatively and perceptually, regarding the fitness for local details and the generalibility for overall trends. Our results demonstrate that our pipeline effectively discovers models that capture fine details and ensure strong generalizability. Additionally, extensive ablation studies show that both multi-modality and multi-step reasoning play crucial roles in discovering favorable models.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2509.25946

Country: North America (0.28)

Genre: Research Report > New Finding (0.86)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.97)

Add feedback

To Trade or Not to Trade: An Agentic Approach to Estimating Market Risk Improves Trading Decisions

Emmanoulopoulos, Dimitrios, Olby, Ollie, Lyon, Justin, Stillman, Namid R.

arXiv.org Artificial IntelligenceJul-14-2025

Applications range from technical analysis of a company's fundamental value, wider market sentiment, factor analysis and most tasks involving some form of natural language processing (NLP) [1, 2]. The implications to trading systems will likely be a dramatic increase in the rate and volume of market insights that can be generated to inform decisions. The overall capabilities of LLMs have dramatically increased over the last five years [3]. This has led to an increase in the number of LLMs available, both as proprietary models from frontier labs or as smaller models with open-weights which can be run locally. Given this, the influence of LLMs on trading decisions is expected to be varied and highly model specific. Early work is starting to compare and benchmark these models in tasks specific to financial applications, such as trading decisions, portfolio optimisation, and market analysis [4-10]. As the number of models increases, and their underlying strengths and weaknesses become more apparent, it is expected that different classes of pre-trained models will be more regularly deployed to achieve certain objectives [11, 12]. While these objectives are likely to be significantly linked to NLP-based tasks, such as text summarisation, analysis, and generation, recent LLM architectures give early evidence that more complex tasks can also be automated. These LLMs, such as the'o' series from OpenAI or'R1' from DeepSeek, generate'reasoning' tokens which result in the model performing more in-context analysis of the generated output and has lead to improved performance over a number of key evaluation measures [13, 14].

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2507.08584

Country: North America > United States (0.93)

Genre:

Research Report > New Finding (1.00)
Overview (1.00)

Industry:

Banking & Finance > Trading (1.00)
Government > Regional Government > North America Government > United States Government (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Model Discovery with Grammatical Evolution. An Experiment with Prime Numbers

Skrzyński, Jakub, Sepioło, Dominik, Ligęza, Antoni

arXiv.org Artificial IntelligenceMay-20-2025

Machine Learning produces efficient decision and prediction models based on input-output data only. Such models have the form of decision trees or neural nets and are far from transparent analytical models, based on mathematical formulas. Analytical model discovery requires additional knowledge and may be performed with Grammatical Evolution. Such models are transparent, concise, and have readable components and structure. This paper reports on a non-trivial experiment with generating such models.

evolutionary algorithm, experiment, machine learning, (11 more...)

arXiv.org Artificial Intelligence

2505.1244

Country:

Europe > Poland > Lesser Poland Province > Kraków (0.06)
Europe > Switzerland (0.05)
Europe > Poland > Łódź Province > Łódź (0.05)
Europe > Austria > Styria > Graz (0.05)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.35)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.35)

Add feedback

Discovering dynamical laws for speech gestures

Kirkham, Sam

arXiv.org Artificial IntelligenceMay-6-2025

A fundamental challenge in the cognitive sciences is discovering the dynamics that govern behaviour. Take the example of spoken language, which is characterised by a highly variable and complex set of physical movements that map onto the small set of cognitive units that comprise language. What are the fundamental dynamical principles behind the movements that structure speech production? In this study, we discover models in the form of symbolic equations that govern articulatory gestures during speech. A sparse symbolic regression algorithm is used to discover models from kinematic data on the tongue and lips. We explore these candidate models using analytical techniques and numerical simulations, and find that a second-order linear model achieves high levels of accuracy, but a nonlinear force is required to properly model articulatory dynamics in approximately one third of cases. This supports the proposal that an autonomous, nonlinear, second-order differential equation is a viable dynamical law for articulatory gestures in speech. We conclude by identifying future opportunities and obstacles in data-driven model discovery and outline prospects for discovering the dynamical principles that govern language, brain and behaviour.

artificial intelligence, machine learning, trajectory, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.1111/cogs.70064

2504.04849

Country: North America > United States (1.00)

Genre: Research Report > New Finding (0.87)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.67)
(2 more...)

Add feedback

BoxingGym: Benchmarking Progress in Automated Experimental Design and Model Discovery

Gandhi, Kanishk, Li, Michael Y., Goodyear, Lyle, Li, Louise, Bhaskar, Aditi, Zaman, Mohammed, Goodman, Noah D.

arXiv.org Artificial IntelligenceJan-2-2025

Understanding the world and explaining it with scientific theories is a central aspiration of artificial intelligence research. Proposing theories, designing experiments to test them, and then revising them based on data are fundamental to scientific discovery. Despite the significant promise of LLM-based scientific agents, no benchmarks systematically test LLM's ability to propose scientific models, collect experimental data, and revise them in light of new data. We introduce BoxingGym, a benchmark with 10 environments for systematically evaluating both experimental design (e.g. collecting data to test a scientific theory) and model discovery (e.g. proposing and revising scientific theories). To enable tractable and quantitative evaluation, we implement each environment as a generative probabilistic model with which a scientific agent can run interactive experiments. These probabilistic models are drawn from various real-world scientific domains ranging from psychology to ecology. To quantitatively evaluate a scientific agent's ability to collect informative experimental data, we compute the expected information gain (EIG), an information-theoretic quantity which measures how much an experiment reduces uncertainty about the parameters of a generative model. A good scientific theory is a concise and predictive explanation. Therefore, to quantitatively evaluate model discovery, we ask a scientific agent to explain their model and then assess whether this explanation enables another scientific agent to make reliable predictions about this environment. In addition to this explanation-based evaluation, we compute standard model evaluation metrics such as prediction errors. We find that current LLMs, such as GPT-4o, struggle with both experimental design and model discovery. We find that augmenting the LLM-based agent with an explicit statistical model does not reliably improve these results.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2501.0154

Country:

North America > United States > Virginia > Arlington County > Arlington (0.04)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)

Genre: Research Report > New Finding (0.68)

Industry: Health & Medicine > Therapeutic Area > Oncology (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

Add feedback

On sparse regression, Lp-regularization, and automated model discovery

McCulloch, Jeremy A., Pierre, Skyler R. St., Linka, Kevin, Kuhl, Ellen

arXiv.org Artificial IntelligenceJan-15-2024

Sparse regression and feature extraction are the cornerstones of knowledge discovery from massive data. Their goal is to discover interpretable and predictive models that provide simple relationships among scientific variables. While the statistical tools for model discovery are well established in the context of linear regression, their generalization to nonlinear regression in material modeling is highly problem-specific and insufficiently understood. Here we explore the potential of neural networks for automatic model discovery and induce sparsity by a hybrid approach that combines two strategies: regularization and physical constraints. We integrate the concept of Lp regularization for subset selection with constitutive neural networks that leverage our domain knowledge in kinematics and thermodynamics. We train our networks with both, synthetic and real data, and perform several thousand discovery runs to infer common guidelines and trends: L2 regularization or ridge regression is unsuitable for model discovery; L1 regularization or lasso promotes sparsity, but induces strong bias that may aggressively change the results; only L0 regularization allows us to transparently fine-tune the trade-off between interpretability and predictability, simplicity and accuracy, and bias and variance. With these insights, we demonstrate that Lp regularized constitutive neural networks can simultaneously discover both, interpretable models and physically meaningful parameters. We anticipate that our findings will generalize to alternative discovery techniques such as sparse and symbolic regression, and to other domains such as biology, chemistry, or medicine. Our ability to automatically discover material models from data could have tremendous applications in generative material design and open new opportunities to manipulate matter, alter properties of existing materials, and discover new materials with user-defined properties.

artificial intelligence, machine learning, regularization, (17 more...)

arXiv.org Artificial Intelligence

2310.06872

Country:

North America > United States > California (0.14)
Europe > Germany (0.14)

Genre: Research Report > New Finding (1.00)

Industry:

Energy > Oil & Gas > Upstream (0.94)
Health & Medicine (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Automatically identifying ordinary differential equations from data

Egan, Kevin, Li, Weizhen, Carvalho, Rui

arXiv.org Artificial IntelligenceMay-3-2023

Discovering nonlinear differential equations that describe system dynamics from empirical data is a fundamental challenge in contemporary science. Here, we propose a methodology to identify dynamical laws by integrating denoising techniques to smooth the signal, sparse regression to identify the relevant parameters, and bootstrap confidence intervals to quantify the uncertainty of the estimates. We evaluate our method on well-known ordinary differential equations with an ensemble of random initial conditions, time series of increasing length, and varying signal-to-noise ratios. Our algorithm consistently identifies three-dimensional systems, given moderately-sized time series and high levels of signal quality relative to background noise. By accurately discovering dynamical systems automatically, our methodology has the potential to impact the understanding of complex systems, especially in fields where data are abundant, but developing mathematical models demands considerable effort.

artificial intelligence, machine learning, snr, (16 more...)

arXiv.org Artificial Intelligence

2304.11182

Country:

Europe > United Kingdom > England > Durham > Durham (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)

Genre: Research Report (0.82)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)

Add feedback

Convergence of uncertainty estimates in Ensemble and Bayesian sparse model discovery

Gao, L. Mars, Fasel, Urban, Brunton, Steven L., Kutz, J. Nathan

arXiv.org Artificial IntelligenceApr-26-2023

Sparse model identification enables nonlinear dynamical system discovery from data. However, the control of false discoveries for sparse model identification is challenging, especially in the low-data and high-noise limit. In this paper, we perform a theoretical study on ensemble sparse model discovery, which shows empirical success in terms of accuracy and robustness to noise. In particular, we analyse the bootstrapping-based sequential thresholding least-squares estimator. We show that this bootstrapping-based ensembling technique can perform a provably correct variable selection procedure with an exponential convergence rate of the error rate. In addition, we show that the ensemble sparse model discovery method can perform computationally efficient uncertainty estimation, compared to expensive Bayesian uncertainty quantification methods via MCMC. We demonstrate the convergence properties and connection to uncertainty quantification in various numerical studies on synthetic sparse linear regression and sparse model discovery. The experiments on sparse linear regression support that the bootstrapping-based sequential thresholding least-squares method has better performance for sparse variable selection compared to LASSO, thresholding least-squares, and bootstrapping-based LASSO. In the sparse model discovery experiment, we show that the bootstrapping-based sequential thresholding least-squares method can provide valid uncertainty quantification, converging to a delta measure centered around the true value with increased sample sizes. Finally, we highlight the improved robustness to hyperparameter selection under shifting noise and sparsity levels of the bootstrapping-based sequential thresholding least-squares method compared to other sparse regression methods.

artificial intelligence, machine learning, probability, (18 more...)

arXiv.org Artificial Intelligence

2301.12649

Country:

North America > United States > Washington > King County > Seattle (0.14)
Europe > Latvia > Riga Municipality > Riga (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback