AITopics

Country: North America > United States (0.92)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.92)

Industry:

Energy (0.67)
Government (0.46)
Banking & Finance (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(2 more...)

Neural Information Processing SystemsFeb-17-2026, 20:02:31 GMT

bc8f76d9caadd48f77025b1c889d2e2d-Paper-Conference.pdf

artificial intelligence, machine learning, ofm, (16 more...)

Country:

Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Russia (0.04)

Genre: Research Report > New Finding (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Neural Information Processing SystemsFeb-10-2026, 09:45:09 GMT

Amathematicaltheoryofcooperativecommunication

Cooperativecommunication plays acentral role intheories ofhuman cognition, language, development, culture, and human-robot interaction. Computational simulations support and elaborate our theoretical results, and demonstrate fit to human behavior.

artificial intelligence, machine learning, perturbation, (19 more...)

Country:

North America > United States > Texas > Travis County > Austin (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.94)
Information Technology > Artificial Intelligence > Robots (0.68)

Neural Information Processing SystemsOct-10-2025, 15:08:28 GMT

Optimal Flow Matching: Learning Straight Trajectories in Just One Step

However, such processes usually have curved trajectories, resulting in time-consuming ODE integration for sampling.

flow matching, ofm, trajectory, (14 more...)

Country:

Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Russia (0.04)

Genre: Research Report > New Finding (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Gouvêa, Rogério Almeida, De Breuck, Pierre-Paul, Pretto, Tatiane, Rignanese, Gian-Marco, Santos, Marcos José Leite

Combining feature-based approaches with graph neural networks and symbolic regression for synergistic performance and interpretability

arXiv.org Artificial IntelligenceSep-8-2025

To avoid the featuri zation bottleneck of traditional descriptors, we also leverage GNNs to generate fast, latent-space approximations of MatMiner (ℓ-MM) and Orbital Field Matrix (ℓ-OFM) features. Finally, we augment this feature set with new descriptors derived via symbolic regression. This multifac eted strategy aims to create a more robust, accurate, and versatile featurizer that capitalizes on the distinct strengths of each approach to be useful for a wider range of dataset sizes. To simplify the generation of all those features, a package was developed named MatterVial standing for MATerials fea T uR e E xtraction Via I nterpretable Artificial L earning, which, besides producing all latent-space features from the GNN models, aids i n obtaining the interpretable chemical descriptors that correlate to these high-level features. This is achieved through techniques such as SHapley Additive exPlanations (SHAP) analysi s in surrogate models and symbolic regression via Sure Independence Screening and Sparsifying Operator (SISSO) to obtain an approximate formula from the most important features. Our re sults demonstrate an overall improvement in all analyzed datasets compare d with the baseline MatMiner featurizer. In addition, it surpassed the performance of the individua l GNN models in several cases, indicating that the combination of traditional and l atent-space features leads to a more robust generalization.

artificial intelligence, data mining, machine learning, (17 more...)

2509.03547

Country:

Europe (0.67)
South America > Brazil (0.28)
North America > United States > California (0.28)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Data Science > Data Mining (0.92)

Shi, Yaozhong, Ross, Zachary E., Asimaki, Domniki, Azizzadenesheli, Kamyar

Stochastic Process Learning via Operator Flow Matching

arXiv.org Artificial IntelligenceJan-8-2025

ofm, posterior sample, regression, (13 more...)

2501.04126

Country:

North America > United States > Wisconsin > Dane County > Madison (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report > Promising Solution (0.34)

Industry: Energy (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.68)

Pelke, Rebecca, Cubero-Cascante, Jose, Bosbach, Nils, Staudigl, Felix, Leupers, Rainer, Joseph, Jan Moritz

CLSA-CIM: A Cross-Layer Scheduling Approach for Computing-in-Memory Architectures

arXiv.org Artificial IntelligenceJan-17-2024

The demand for efficient machine learning (ML) accelerators is growing rapidly, driving the development of novel computing concepts such as resistive random access memory (RRAM)-based tiled computing-in-memory (CIM) architectures. CIM allows to compute within the memory unit, resulting in faster data processing and reduced power consumption. Efficient compiler algorithms are essential to exploit the potential of tiled CIM architectures. While conventional ML compilers focus on code generation for CPUs, GPUs, and other von Neumann architectures, adaptations are needed to cover CIM architectures. Cross-layer scheduling is a promising approach, as it enhances the utilization of CIM cores, thereby accelerating computations. Although similar concepts are implicitly used in previous work, there is a lack of clear and quantifiable algorithmic definitions for cross-layer scheduling for tiled CIM architectures. To close this gap, we present CLSA-CIM, a cross-layer scheduling algorithm for tiled CIM architectures. We integrate CLSA-CIM with existing weight-mapping strategies and compare performance against state-of-the-art (SOTA) scheduling algorithms. CLSA-CIM improves the utilization by up to 17.9 x , resulting in an overall speedup increase of up to 29.2 x compared to SOTA.

architecture, clsa-cim, weight duplication, (16 more...)

2401.07671

Country:

Europe > Germany (0.04)
Asia > China (0.04)

Genre: Research Report > Promising Solution (0.34)

Industry: Semiconductors & Electronics (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)

Lahmer, Seyyidahmed, Khoshsirat, Aria, Rossi, Michele, Zanella, Andrea

Energy Consumption of Neural Networks on NVIDIA Edge Boards: an Empirical Model

arXiv.org Artificial IntelligenceOct-4-2022

Recently, there has been a trend of shifting the execution of deep learning inference tasks toward the edge of the network, closer to the user, to reduce latency and preserve data privacy. At the same time, growing interest is being devoted to the energetic sustainability of machine learning. At the intersection of these trends, we hence find the energetic characterization of machine learning at the edge, which is attracting increasing attention. Unfortunately, calculating the energy consumption of a given neural network during inference is complicated by the heterogeneity of the possible underlying hardware implementation. In this work, we hence aim at profiling the energetic consumption of inference tasks for some modern edge nodes and deriving simple but realistic models. To this end, we performed a large number of experiments to collect the energy consumption of convolutional and fully connected layers on two well-known edge boards by NVIDIA, namely Jetson TX2 and Xavier. From the measurements, we have then distilled a simple, practical model that can provide an estimate of the energy consumption of a certain inference task on the considered boards. We believe that this model can be used in many contexts as, for instance, to guide the search for efficient architectures in Neural Architecture Search, as a heuristic in Neural Network pruning, or to find energy-efficient offloading strategies in a Split computing context, or simply to evaluate the energetic performance of Deep Neural Network architectures.

artificial intelligence, deep learning, machine learning, (18 more...)

doi: 10.23919/WiOpt56218.2022.9930584

2210.01625

Country:

Europe > Italy (0.04)
Asia (0.04)

Genre: Research Report (0.64)

Industry:

Information Technology > Hardware (0.73)
Information Technology > Security & Privacy (0.54)
Energy > Renewable (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Stuhr, Bonifaz, Brauer, Jürgen

Don't miss the Mismatch: Investigating the Objective Function Mismatch for Unsupervised Representation Learning

arXiv.org Artificial IntelligenceSep-4-2020

Finding general evaluation metrics for unsupervised representation learning techniques is a challenging open research question, which recently has become more and more necessary due to the increasing interest in unsupervised methods. Even though these methods promise beneficial representation characteristics, most approaches currently suffer from the objective function mismatch. This mismatch states that the performance on a desired target task can decrease when the unsupervised pretext task is learned too long - especially when both tasks are ill-posed. In this work, we build upon the widely used linear evaluation protocol and define new general evaluation metrics to quantitatively capture the objective function mismatch and the more generic metrics mismatch. We discuss the usability and stability of our protocols on a variety of pretext and target tasks and study mismatches in a wide range of experiments. Thereby we disclose dependencies of the objective function mismatch across several pretext and target tasks with respect to the pretext model's representation size, target model complexity, pretext and target augmentations as well as pretext and target task types.

artificial intelligence, machine learning, representation, (19 more...)

2009.02383

Country: Europe > Germany > North Rhine-Westphalia > Upper Bavaria > Munich (0.04)

Genre: Research Report > New Finding (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)