AITopics | selection criteria

Collaborating Authors

selection criteria

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

StructuralKernelSearchviaBayesianOptimization andSymbolicalOptimalTransport

Neural Information Processing SystemsFeb-13-2026, 05:00:27 GMT

Despite recent advances in automated machine learning, model selection is still acomplexandcomputationally intensiveprocess.

artificial intelligence, kernel, machine learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Georgia > Fulton County > Atlanta (0.05)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > Spain > Andalusia > Cádiz Province > Cadiz (0.04)
Europe > Germany (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

98c56bce74669e2e4e7a9fc1caa8c326-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-13-2026, 02:26:17 GMT

algorithm, nn-graph, triangle inequality, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.53)

Add feedback

fb60d411a5c5b72b2e7d3527cfc84fd0-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-11-2026, 05:31:41 GMT

contribution, dataset, final version, (12 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

We thank the reviewer for taking the time to review our submission and for their helpful

Neural Information Processing SystemsOct-3-2025, 06:56:09 GMT

We will elaborate on works relevant to the noiseless triplet setting in the related work section. Thank you also for pointing out the typos. That is an interesting way of looking at the problem. If the dataset consists of hierarchical clusters as in the condition for Theorem 4.6, This is made more explicit in the discussion following Theorem 4.4. Houle, Michael E., and Michael Nett (2013), Rank cover trees for nearest neighbor search, International Conference on

algorithm, nn-graph, triangle inequality, (14 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.37)

Add feedback

Agents of Discovery

Diefenbacher, Sascha, Hallin, Anna, Kasieczka, Gregor, Krämer, Michael, Lauscher, Anne, Lukas, Tim

arXiv.org Artificial IntelligenceSep-11-2025

The substantial data volumes encountered in modern particle physics and other domains of fundamental physics research allow (and require) the use of increasingly complex data analysis tools and workflows. While the use of machine learning (ML) tools for data analysis has recently proliferated, these tools are typically special-purpose algorithms that rely, for example, on encoded physics knowledge to reach optimal performance. In this work, we investigate a new and orthogonal direction: Using recent progress in large language models (LLMs) to create a team of agents -- instances of LLMs with specific subtasks -- that jointly solve data analysis-based research problems in a way similar to how a human researcher might: by creating code to operate standard tools and libraries (including ML systems) and by building on results of previous iterations. If successful, such agent-based systems could be deployed to automate routine analysis components to counteract the increasing complexity of modern tool chains. To investigate the capabilities of current-generation commercial LLMs, we consider the task of anomaly detection via the publicly available and highly-studied LHC Olympics dataset. Several current models by OpenAI (GPT-4o, o4-mini, GPT-4.1, and GPT-5) are investigated and their stability tested. Overall, we observe the capacity of the agent-based system to solve this data analysis problem. The best agent-created solutions mirror the performance of human state-of-the-art results.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2509.08535

Country:

Europe > Austria > Vienna (0.14)
Europe > Germany > Hamburg (0.04)
North America > United States > New Mexico > Bernalillo County > Albuquerque (0.04)
(2 more...)

Genre: Research Report (1.00)

Industry: Government (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Structural Kernel Search via Bayesian Optimization and Symbolical Optimal Transport

Neural Information Processing SystemsAug-19-2025, 22:29:46 GMT

We propose a novel, efficient search method through a general, structured kernel space.

artificial intelligence, kernel, machine learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Georgia > Fulton County > Atlanta (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(2 more...)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.50)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.49)

Add feedback

fb60d411a5c5b72b2e7d3527cfc84fd0-AuthorFeedback.pdf

Neural Information Processing SystemsAug-17-2025, 09:34:13 GMT

We thank the reviewers for their time and valuable feedback. We will clearly mention this contribution in our final version. Our design principle is in contrast to the OpenML-CC18 and the TU datasets [43] Regarding the package description, it is provided with example code snippets in Appendices E.1 and E.2, due to space We thank R4 for pointing out the issue with PRC-AUC. Average Precision (AP), and observed the same trend (see Table above). We will update this in the final version.

contribution, dataset, final version, (12 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

5c04925674920eb58467fb52ce4ef728-Paper.pdf

Neural Information Processing SystemsAug-14-2025, 17:06:59 GMT

few-shot learning, learning, selection, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > New York > New York County > New York City (0.04)
Europe > Italy > Tuscany > Florence (0.04)
(2 more...)

Genre: Research Report > New Finding (0.94)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.69)

Add feedback

D3: Diversity, Difficulty, and Dependability-Aware Data Selection for Sample-Efficient LLM Instruction Tuning

Zhang, Jia, Zhang, Chen-Xi, Liu, Yao, Jin, Yi-Xuan, Yang, Xiao-Wen, Zheng, Bo, Liu, Yi, Guo, Lan-Zhe

arXiv.org Artificial IntelligenceMar-14-2025

Recent advancements in instruction tuning for large language models (LLMs) suggest that a small, high-quality dataset can significantly equip LLMs with instruction-following capabilities, outperforming large datasets often burdened by quality and redundancy issues. However, the challenge lies in automatically identifying valuable subsets from large datasets to boost both the effectiveness and efficiency of instruction tuning. In this paper, we first establish data selection criteria based on three distinct aspects of data value: diversity, difficulty, and dependability, and then propose the D3 method comprising two key steps of scoring and selection. Specifically, in the scoring step, we define the diversity function to measure sample distinctiveness and introduce the uncertainty-based prediction difficulty to evaluate sample difficulty by mitigating the interference of context-oriented generation diversity. Additionally, we integrate an external LLM for dependability assessment. In the selection step, we formulate the D3 weighted coreset objective, which jointly optimizes three aspects of data value to solve for the most valuable subset. The two steps of D3 can iterate multiple rounds, incorporating feedback to refine the selection focus adaptively. Experiments on three datasets demonstrate the effectiveness of D3 in endowing LLMs with competitive or even superior instruction-following capabilities using less than 10% of the entire dataset.

dataset, diversity, instruction, (17 more...)

arXiv.org Artificial Intelligence

2503.11441

Country:

Asia > China > Jiangsu Province > Nanjing (0.04)
Asia > Myanmar > Tanintharyi Region > Dawei (0.04)

Genre:

Research Report (1.00)
Overview (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

CRPS-Based Targeted Sequential Design with Application in Chemical Space

Friedli, Lea, Gautier, Athénaïs, Broccard, Anna, Ginsbourger, David

arXiv.org Machine LearningMar-14-2025

Sequential design of real and computer experiments via Gaussian Process (GP) models has proven useful for parsimonious, goal-oriented data acquisition purposes. In this work, we focus on acquisition strategies for a GP model that needs to be accurate within a predefined range of the response of interest. Such an approach is useful in various fields including synthetic chemistry, where finding molecules with particular properties is essential for developing useful materials and effective medications. GP modeling and sequential design of experiments have been successfully applied to a plethora of domains, including molecule research. Our main contribution here is to use the threshold-weighted Continuous Ranked Probability Score (CRPS) as a basic building block for acquisition functions employed within sequential design. We study pointwise and integral criteria relying on two different weighting measures and benchmark them against competitors, demonstrating improved performance with respect to considered goals. The resulting acquisition strategies are applicable to a wide range of fields and pave the way to further developing sequential design relying on scoring rules.

criteria, excursion, kernel, (17 more...)

arXiv.org Machine Learning

2503.1125

Country: