AITopics | training and test

Collaborating Authors

training and test

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

84ddfb34126fc3a48ee38d7044e87276-Supplemental.pdf

Neural Information Processing SystemsFeb-9-2026, 05:16:05 GMT

dataset, perturbation function, residual block layer, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > California (0.04)
North America > Canada (0.04)

Industry: Transportation (0.49)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)

Add feedback

SupplementaryMaterial Spin-WeightedSphericalCNNs

Neural Information Processing SystemsFeb-8-2026, 15:37:15 GMT

When predicting an image from avectorfield, we introduce color inthe output based on the targetcategory.

artificial intelligence, machine learning, supplementarymaterial spin-weightedsphericalcnn, (15 more...)

Neural Information Processing Systems

Country:

North America > United States (0.05)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.05)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.32)

Add feedback

Towards Explainable Personalized Recommendations by Learning from Users' Photos

Díez, Jorge, Pérez-Núñez, Pablo, Luaces, Oscar, Remeseiro, Beatriz, Bahamonde, Antonio

arXiv.org Artificial IntelligenceOct-27-2025

Explaining the output of a complex system, such as a Recommender System (RS), is becoming of utmost importance for both users and companies. In this paper we explore the idea that personalized explanations can be learned as recommendation themselves. There are plenty of online services where users can upload some photos, in addition to rating items. We assume that users take these photos to reinforce or justify their opinions about the items. For this reason we try to predict what photo a user would take of an item, because that image is the argument that can best convince her of the qualities of the item. In this sense, an RS can explain its results and, therefore, increase its reliability. Furthermore, once we have a model to predict attractive images for users, we can estimate their distribution. The paper includes a formal framework that estimates the authorship probability for a given pair (user, photo). To illustrate the proposal, we use data gathered from TripAdvisor containing the reviews (with photos) of restaurants in six cities of different sizes. Keywords: Recommender Systems, Personalization, Explainability, Photo, Collaborative 1. Introduction Explainable Artificial Intelligence (XAI) is becoming an important area of interest since explainability is increasingly necessary to meet stakeholder demands. In particular, the General Data Protection Regulation (GDPR) [29] of the European Union demands transparency in systems that take decisions affecting people, making explanations more needed than ever. Additionally, explanations may help increase the trust of users in AI algorithms, since people rely not only on their efficacy but also on the degree of understanding of the process they follow. Since they provide suggestions to users, explainability plays an important role on them.

artificial intelligence, machine learning, restaurant, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.1016/j.ins.2020.02.018

2510.21455

Country:

Europe > Spain > Galicia > Madrid (0.05)
North America > United States > New York > New York County > New York City (0.04)

Genre: Research Report (1.00)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

Scaling Physical Reasoning with the PHYSICS Dataset

Zheng, Shenghe, Cheng, Qianjia, Yao, Junchi, Wu, Mengsong, He, Haonan, Ding, Ning, Cheng, Yu, Hu, Shuyue, Bai, Lei, Zhou, Dongzhan, Cui, Ganqu, Ye, Peng

arXiv.org Artificial IntelligenceOct-20-2025

Large Language Models (LLMs) have achieved remarkable progress on advanced reasoning tasks such as mathematics and coding competitions. Meanwhile, physics, despite being both reasoning-intensive and essential to real-world understanding, received limited academic and industrial attention. This paper introduces PHYSICS, a dataset containing 16,568 high-quality physics problems spanning subjects and difficulty levels, to facilitate this issue. Specifically, PHYSICS is curated with exercises from over 100 textbooks through a carefully designed pipeline for quality control. It covers five major physics domains: Mechanics, Electromagnetism, Thermodynamics, Optics, and Modern Physics. It also spans a wide range of difficulty levels, from high school to graduate-level physics courses. To utilize the data for improving and evaluating the model's physical reasoning capabilities, we split the dataset into training and test sets, and provide reasoning paths generated by powerful reasoning models for the training data to facilitate model training. In addition, for the evaluation part, we find that existing evaluation frameworks exhibit biases in aspects such as units, simplification, and precision in physics domain. To balance efficiency and accuracy, we introduce a Rule+Model evaluation framework tailored to physics problems. Our evaluations on current state-of-the-art open-source and proprietary models highlight the limitations of current models in handling physics-related tasks. We hope that our dataset and evaluation methodology will jointly advance the development of LLMs in the field of physics. The code and data can be found at: https://github.com/Zhengsh123/PHYSICS.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2506.00022

Country:

Asia > China > Shanghai > Shanghai (0.04)
Asia > Singapore (0.04)
Asia > Indonesia > Bali (0.04)
Asia > China > Heilongjiang Province > Harbin (0.04)

Genre: Research Report > New Finding (0.46)

Industry:

Education > Educational Setting (0.35)
Education > Curriculum > Subject-Specific Education (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

PRESOL: a web-based computational setting for feature-based flare forecasting

Curletto, Chiara, Massa, Paolo, Tagliafico, Valeria, Campi, Cristina, Benvenuto, Federico, Piana, Michele, Tacchino, Andrea

arXiv.org Artificial IntelligenceOct-3-2025

Solar flares are the most explosive phenomena in the solar system and the main trigger of the events' chain that starts from Coronal Mass Ejections and leads to geomagnetic storms with possible impacts on the infrastructures at Earth. Data-driven solar flare forecasting relies on either deep learning approaches, which are operationally promising but with a low explainability degree, or machine learning algorithms, which can provide information on the physical descriptors that mostly impact the prediction. This paper describes a web-based technological platform for the execution of a computational pipeline of feature-based machine learning methods that provide predictions of the flare occurrence, feature ranking information, and assessment of the prediction performances.

algorithm, artificial intelligence, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2510.01799

Country:

Europe > Switzerland (0.04)
Europe > Italy (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)

Add feedback

Parametrized Quantum Circuit Learning for Quantum Chemical Applications

Jones, Grier M., Prasad, Viki Kumar, Fekl, Ulrich, Jacobsen, Hans-Arno

arXiv.org Artificial IntelligenceSep-19-2025

Despite numerous proposed applications, there remains limited exploration of datasets relevant to quantum chemistry. In this study, we investigate the potential benefits and limitations of PQCs on two chemically meaningful datasets: (1) the BSE49 dataset, containing bond separation energies for 49 different classes of chemical bonds, and (2) a dataset of water conformations, where coupled-cluster singles and doubles (CCSD) wavefunctions are predicted from lower-level electronic structure methods using the data-driven coupled-cluster (DDCC) approach. We construct a comprehensive set of 168 PQCs by combining 14 data encoding strategies with 12 variational ansätze, and evaluate their performance on circuits with 5 and 16 qubits. Our initial analysis examines the impact of circuit structure on model performance using state-vector simulations. We then explore how circuit depth and training set size influence model performance. Finally, we assess the performance of the best-performing PQCs on current quantum hardware, using both noisy simulations ("fake" backends) and real quantum devices. Our findings underscore the challenges of applying PQCs to chemically relevant problems that are straightforward for classical machine learning methods but remain non-trivial for quantum approaches. 2 1 Introduction In recent years, machine learning (ML) has emerged as a popular tool in chemistry to reveal new patterns in data, provide new insights beyond simple models, accelerate computations, and analyze chemical space. For computational chemists, the primary goal of applying ML is often to circumvent the explicit calculation of molecular properties, which can be computationally expensive for large datasets.

artificial intelligence, deep learning, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2507.08183

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > Canada > Quebec (0.05)
North America > Canada > Alberta > Census Division No. 6 > Calgary Metropolitan Region > Calgary (0.04)
North America > United States (0.04)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Hardware (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Driving Accurate Allergen Prediction with Protein Language Models and Generalization-Focused Evaluation

Wong, Brian Shing-Hei, Kim, Joshua Mincheol, Fung, Sin-Hang, Xiong, Qing, Ao, Kelvin Fu-Kiu, Wei, Junkang, Wang, Ran, Wang, Dan Michelle, Zhou, Jingying, Feng, Bo, Cheng, Alfred Sze-Lok, Yip, Kevin Y., Tsui, Stephen Kwok-Wing, Cao, Qin

arXiv.org Artificial IntelligenceAug-18-2025

Allergens, typically proteins capable of triggering adverse immune responses, represent a significant public health challenge. To accurately identify allergen proteins, we introduce Applm (Allergen Prediction with Protein Language Models), a computational framework that leverages the 100-billion parameter xTrimoPGLM protein language model. We show that Applm consistently outperforms seven state-of-the-art methods in a diverse set of tasks that closely resemble difficult real-world scenarios. These include identifying novel allergens that lack similar examples in the training set, differentiating between allergens and non-allergens among homologs with high sequence similarity, and assessing functional consequences of mutations that create few changes to the protein sequences. Our analysis confirms that xTrimoPGLM, originally trained on one trillion tokens to capture general protein sequence characteristics, is crucial for Applm's performance by detecting important differences among protein sequences. In addition to providing Applm as open-source software, we also provide our carefully curated benchmark datasets to facilitate future research.

artificial intelligence, machine learning, natural language, (22 more...)

arXiv.org Artificial Intelligence

2508.10541

Country:

North America > United States > California > San Diego County > La Jolla (0.04)
Asia > China > Guangdong Province > Shenzhen (0.04)
South America > Brazil > São Paulo > Santos (0.04)
(3 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Therapeutic Area > Immunology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(2 more...)

Add feedback

Non-native Children's Automatic Speech Assessment Challenge (NOCASA)

Getman, Yaroslav, Grósz, Tamás, Kurimo, Mikko, Salvi, Giampiero

arXiv.org Artificial IntelligenceAug-14-2025

This paper presents the "Non-native Children's Automatic Speech Assessment" (NOCASA) - a data competition part of the IEEE MLSP 2025 conference. NOCASA challenges participants to develop new systems that can assess single-word pronunciations of young second language (L2) learners as part of a gamified pronunciation training app. To achieve this, several issues must be addressed, most notably the limited nature of available training data and the highly unbalanced distribution among the pronunciation level categories. To expedite the development, we provide a pseudo-anonymized training data (TeflonNorL2), containing 10,334 recordings from 44 speakers attempting to pronounce 205 distinct Norwegian words, human-rated on a 1 to 5 scale (number of stars that should be given in the game). In addition to the data, two already trained systems are released as official baselines: an SVM classifier trained on the ComParE_16 acoustic feature set and a multi-task wav2vec 2.0 model. The latter achieves the best performance on the challenge test set, with an unweighted average recall (UAR) of 36.37%.

artificial intelligence, machine learning, pronunciation, (16 more...)

arXiv.org Artificial Intelligence

2504.20678

Country:

Europe > Middle East > Republic of Türkiye > Istanbul Province > Istanbul (0.40)
Asia > Middle East > Republic of Türkiye > Istanbul Province > Istanbul (0.40)
Europe > Norway (0.04)
(6 more...)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.34)

Add feedback

AdaptHetero: Machine Learning Interpretation-Driven Subgroup Adaptation for EHR-Based Clinical Prediction

Liao, Ling, Aagaard, Eva

arXiv.org Machine LearningAug-1-2025

However, the in t rinsic complexity and heterogeneity of EHR data limit its effectiveness in guiding subgroup - specific modelin g . W e propose AdaptHetero, a novel MLI - driven framework that transforms interpretability insights into actionable guidance for tailor ing model training and evaluation across subpopulations within individual hospital systems . E valuated on th ree large - scale EH R datasets -- GOSSIS - 1 - eICU, WiDS, and MIMIC - IV -- AdaptHetero consistently identif ies heterogeneous model behaviors in predicting ICU mortality, in - hospital death, and hidden hypoxemia. By integrating SHAP - based interpretation and unsupervised clustering, the framework enhances the identification of clinicall y meaningful subgroup - specific characteristics, leading to improved predictive performance and optimized clinical deployment . Introduction Machine learning interpretation (MLI) techniques are increasingly leveraged in the analysis of electronic health records (EHRs) to reveal latent clinical patterns and to support trustworthy, actionable decision - making in high - stakes healthcare settings .

artificial intelligence, machine learning, subgroup, (14 more...)

arXiv.org Machine Learning

2507.21197

Country:

North America > United States > Massachusetts > Suffolk County > Boston (0.04)
Asia > Middle East > Israel (0.04)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.93)

Industry:

Health & Medicine > Health Care Providers & Services (1.00)
Health & Medicine > Health Care Technology > Medical Record (0.55)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Add feedback

Machine Learning-based Regional Cooling Demand Prediction with Optimised Dataset Partitioning

Zhang, Meng, Li, Zhihui, Yu, Zhibin

arXiv.org Artificial IntelligenceMar-4-2025

In the context of global warming, even relatively cooler countries like the UK are experiencing a rise in cooling demand, particularly in southern regions such as London. This growing demand, especially during the summer months, presents significant challenges for energy management systems. Accurately predicting cooling demand in urban domestic buildings is essential for maintaining energy efficiency. This study introduces a generalised framework for developing high-resolution Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU) networks using physical model-based summer cooling demand data. To maximise the predictive capability and generalisation ability of the models under limited data scenarios, four distinct data partitioning strategies were implemented, including the extrapolation, month-based interpolation, global interpolation, and day-based interpolation. Bayesian Optimisation (BO) was then applied to fine-tune the hyper-parameters, substantially improving the framework predictive accuracy. Results show that the day-based interpolation GRU model demonstrated the best performance due to its ability to retain both the data randomness and the time sequence continuity characteristics. This optimal model achieves a Root Mean Squared Error (RMSE) of 2.22%, a Mean Absolute Error (MAE) of 0.87%, and a coefficient of determination (R square) of 0.9386 on the test set. The generalisation ability of this framework was further evaluated by forecasting.

extrapolation, interpolation, prediction, (17 more...)

arXiv.org Artificial Intelligence

2503.05813

Country:

Europe > United Kingdom (1.00)
Asia > China (0.14)
Africa (0.14)

Genre: Research Report > New Finding (0.88)

Industry:

Construction & Engineering (1.00)
Energy > Power Industry (0.88)
Energy > Oil & Gas > Upstream (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback