AITopics | nlayer

Collaborating Authors

nlayer

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

b9523d484af624986c2e0c630ac44ecb-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 14:15:23 GMT

Lemma B.4. (Lemma 2.1.8 in [4]) For any diffeomorphismf Diffkc Rd and any δ > 0, there exists a finite sequence of(δ,k)-near-identity diffeomorphismsg1,,gs such that f = gs gs 1 g1. Let πi: Rd R denote the projection onto theith coordinate. Supposef: Rd Rd is compactly supported and sufficientlyCk-close to the identity. In this section, we analysis how to make the affine coupling flow with dimension-augmentation invertible. Tohandle this problem, we need to makesure thatRange(F)is tractable for easy sampling.

artificial intelligence, diffkc rd, nlayer, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.30)

Add feedback

3f1a5e8bfcc3005724d246abe454c1e5-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-10-2026, 20:48:26 GMT

benchmark, extrapolation, lc-pfn, (14 more...)

Neural Information Processing Systems

Country: Europe > Germany > Baden-Württemberg > Freiburg (0.05)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

3f1a5e8bfcc3005724d246abe454c1e5-Paper-Conference.pdf

Neural Information Processing SystemsFeb-10-2026, 20:48:20 GMT

benchmark, extrapolation, lc-pfn, (14 more...)

Neural Information Processing Systems

Country: Europe > Germany > Baden-Württemberg > Freiburg (0.05)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

Efficient Bayesian Learning Curve Extrapolation using Prior-Data Fitted Networks

Neural Information Processing SystemsOct-8-2025, 13:00:22 GMT

artificial intelligence, lc-pfn, machine learning, (17 more...)

Neural Information Processing Systems

Country: Europe > Germany > Baden-Württemberg > Freiburg (0.05)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

Do Neural Scaling Laws Exist on Graph Self-Supervised Learning?

Ma, Qian, Mao, Haitao, Liu, Jingzhe, Zhang, Zhehua, Feng, Chunlin, Song, Yu, Shao, Yihan, Ma, Yao

arXiv.org Artificial IntelligenceAug-26-2024

Self-supervised learning~(SSL) is essential to obtain foundation models in NLP and CV domains via effectively leveraging knowledge in large-scale unlabeled data. The reason for its success is that a suitable SSL design can help the model to follow the neural scaling law, i.e., the performance consistently improves with increasing model and dataset sizes. However, it remains a mystery whether existing SSL in the graph domain can follow the scaling behavior toward building Graph Foundation Models~(GFMs) with large-scale pre-training. In this study, we examine whether existing graph SSL techniques can follow the neural scaling behavior with the potential to serve as the essential component for GFMs. Our benchmark includes comprehensive SSL technique implementations with analysis conducted on both the conventional SSL setting and many new settings adopted in other domains. Surprisingly, despite the SSL loss continuously decreasing, no existing graph SSL techniques follow the neural scaling behavior on the downstream performance. The model performance only merely fluctuates on different data scales and model scales. Instead of the scales, the key factors influencing the performance are the choices of model architecture and pretext task design. This paper examines existing SSL techniques for the feasibility of Graph SSL techniques in developing GFMs and opens a new direction for graph SSL design with the new evaluation prototype. Our code implementation is available online to ease reproducibility on https://github.com/GraphSSLScaling/GraphSSLScaling.

downstream performance, nlayer, ssl loss, (13 more...)

arXiv.org Artificial Intelligence

2408.11243

Country:

Asia > Middle East > Jordan (0.04)
North America > United States > Michigan (0.04)

Genre: Research Report > New Finding (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.63)

Add feedback

Fully-fused Multi-Layer Perceptrons on Intel Data Center GPUs

Yuan, Kai, Bauinger, Christoph, Zhang, Xiangyi, Baehr, Pascal, Kirchhart, Matthias, Dabert, Darius, Tousnakhoff, Adrien, Boudier, Pierre, Paulitsch, Michael

arXiv.org Artificial IntelligenceMar-26-2024

This paper presents a SYCL implementation of Multi-Layer Perceptrons (MLPs), which targets and is optimized for the Intel Data Center GPU Max 1550. To increase the performance, our implementation minimizes the slow global memory accesses by maximizing the data reuse within the general register file and the shared local memory by fusing the operations in each layer of the MLP. We show with a simple roofline model that this results in a significant increase in the arithmetic intensity, leading to improved performance, especially for inference. We compare our approach to a similar CUDA implementation for MLPs and show that our implementation on the Intel Data Center GPU outperforms the CUDA implementation on Nvidia's H100 GPU by a factor up to 2.84 in inference and 1.75 in training. The paper also showcases the efficiency of our SYCL implementation in three significant areas: Image Compression, Neural Radiance Fields, and Physics-Informed Machine Learning. In all cases, our implementation outperforms the off-the-shelf Intel Extension for PyTorch (IPEX) implementation on the same Intel GPU by up to a factor of 30 and the CUDA PyTorch version on Nvidia's H100 GPU by up to a factor 19. The code can be found at https://github.com/intel/tiny-dpcpp-nn.

arithmetic intensity, implementation, mlp, (13 more...)

arXiv.org Artificial Intelligence

2403.17607

Country:

Asia > Japan > Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)
Africa > Zimbabwe (0.04)

Genre: Research Report (0.50)

Industry: Information Technology > Services (0.91)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Efficient Bayesian Learning Curve Extrapolation using Prior-Data Fitted Networks

Adriaensen, Steven, Rakotoarison, Herilalaina, Müller, Samuel, Hutter, Frank

arXiv.org Machine LearningOct-31-2023

Learning curve extrapolation aims to predict model performance in later epochs of training, based on the performance in earlier epochs. In this work, we argue that, while the inherent uncertainty in the extrapolation of learning curves warrants a Bayesian approach, existing methods are (i) overly restrictive, and/or (ii) computationally expensive. We describe the first application of prior-data fitted neural networks (PFNs) in this context. A PFN is a transformer, pre-trained on data generated from a prior, to perform approximate Bayesian inference in a single forward pass. We propose LC-PFN, a PFN trained to extrapolate 10 million artificial right-censored learning curves generated from a parametric prior proposed in prior art using MCMC. We demonstrate that LC-PFN can approximate the posterior predictive distribution more accurately than MCMC, while being over 10 000 times faster. We also show that the same LC-PFN achieves competitive performance extrapolating a total of 20 000 real learning curves from four learning curve benchmarks (LCBench, NAS-Bench-201, Taskset, and PD1) that stem from training a wide range of model architectures (MLPs, CNNs, RNNs, and Transformers) on 53 different datasets with varying input modalities (tabular, image, text, and protein data). Finally, we investigate its potential in the context of model selection and find that a simple LC-PFN based predictive early stopping criterion obtains 2 - 6x speed-ups on 45 of these datasets, at virtually no overhead.

artificial intelligence, lc-pfn, machine learning, (17 more...)

arXiv.org Machine Learning

2310.20447

Country: Europe > Germany > Baden-Württemberg > Freiburg (0.05)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback