AITopics | channel

Collaborating Authors

channel

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

ee6c4b99b4c0d3d60efd22c1ecdd9891-Paper-Conference.pdf

Neural Information Processing SystemsFeb-18-2026, 15:14:15 GMT

large language model, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > United States > Hawaii > Honolulu County > Honolulu (0.04)
(4 more...)

Genre:

Research Report > Experimental Study (0.93)
Research Report > Promising Solution (0.67)

Industry:

Health & Medicine (0.68)
Information Technology (0.67)
Education (0.46)
Banking & Finance (0.46)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(3 more...)

Add feedback

Algorithm1: Haarwavelettransformationpseudocode,PyTorch-like

Neural Information Processing SystemsFeb-12-2026, 05:09:07 GMT

D, demonstrating that our FreGAN is frequency-aware and can indeed produce realisticfrequencysignals. Broaderimpact. For HFD, we aggregate the high-frequency components by addingLH,HL,HH and then employ additional downsampling and convolutional layers tocompute the output scores. They are ideal for verifying the quality of the generation in low-shot scenarios. BrecaHAD9 dataset contains 162 images for breast cancer histopathological annotation and diagnosis. We evaluate the performance of our FreGAN and baseline models on more datasets with limited data amounts in Tab.1, namely, Medici, Temple, Bridge, and Wuzhen, all of which contain only 100 training images.

artificial intelligence, fregan, machine learning, (16 more...)

Neural Information Processing Systems

Country:

Asia > China (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Industry: Health & Medicine (0.54)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.50)

Add feedback

c47e6286162ec5442e06fe2b7cb7145f-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 19:47:34 GMT

For each convolutional layer, we would firstly apply a ReLU activation function right after the convolution,andthenapplyamaxpoolingwith kernel_size=2,stride=2toextractthefeature map.

artificial intelligence, machine learning, wtk, (19 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)

Add feedback

93661c10ed346f9692f4d512319799b3-Supplemental.pdf

Neural Information Processing SystemsFeb-9-2026, 23:29:05 GMT

On the training distribution animal and background feature are equally predictive of the label: It=1(y;a)=It=1(y;b)=It=1(y;ab).

artificial intelligence, dkl, machine learning, (18 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

4a36c3c51af11ed9f34615b81edb5bbc-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 18:27:01 GMT

The left panelshows the energy profile for arotation around an O-C-C-C dihedral angle. In the right panel of Figure 4, we show energy predictions along a minimum energy path of an intramolecular hydrogen transfer reaction. A.2.2 3BPADataset The 3BPA dataset contains DFT train test splits of a flexible drug-like organic molecule sampled from different temperature molecular dynamics trajectories [33]. The first step of the algorithm is to contract the generalized Clebsch-Gordan coefficients with the weights of the product basis. Then, the last dimension of cν is contracted with theAi-features' last dimension resulting in the a-tensor with correlation orderν 1.

artificial intelligence, channel, machine learning, (17 more...)

Neural Information Processing Systems

Country: North America > United States (0.05)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.93)

Add feedback

A Method

Neural Information Processing SystemsFeb-7-2026, 18:14:37 GMT

As computing the inverse second-order derivatives is the most computation-intensive operation, we will focus on it. In Section 3.1, we use the trick of least square to compute the We can leverage the Neumann series to compute the matrix inverse. B.1 Proof of the Approximation by Implicit Gradients Here, we provide the proof for J. B.2 Proof of Theorem 3.1 Before we prove our main theorem, we prove several essential lemmas as below. Using Assumption 3.4 and 3.5 directly lead to r By Assumption 3.4, we have r By Lemma B.1 and Lemma B.2, we have r If Assumption 3.4 and 3.5 hold, then the The linear model we use is a matrix that maps the input data into a vector. LeNet model is a convolutional neural network with 4 convolutional layers and 1 fully connected layer.

artificial intelligence, machine learning, privacy risk, (19 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.46)

Industry: Information Technology > Security & Privacy (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

Add feedback

CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning

Li, Xiaoya, Sun, Xiaofei, Wang, Albert, Li, Jiwei, Shum, Chris

arXiv.org Artificial IntelligenceNov-27-2025

The exponential growth in demand for GPU computing resources has created an urgent need for automated CUDA optimization strategies. While recent advances in LLMs show promise for code generation, current SOTA models achieve low success rates in improving CUDA speed. In this paper, we introduce CUDA-L1, an automated reinforcement learning framework for CUDA optimization that employs a novel contrastive RL algorithm. CUDA-L1 achieves significant performance improvements on the CUDA optimization task: trained on A100, it delivers an average speedup of x3.12 with a median speedup of x1.42 against default baselines over across all 250 CUDA kernels of KernelBench, with peak speedups reaching x120. In addition to the default baseline provided by KernelBench, CUDA-L1 demonstrates x2.77 over Torch Compile, x2.88 over Torch Compile with reduce overhead, x2.81 over CUDA Graph implementations, and remarkably x7.72 over cuDNN libraries. Furthermore, the model also demonstrates portability across different GPU architectures. Beyond these benchmark results, CUDA-L1 demonstrates several properties: it 1) discovers a variety of CUDA optimization techniques and learns to combine them strategically to achieve optimal performance; 2) uncovers fundamental principles of CUDA optimization, such as the multiplicative nature of optimizations; 3) identifies non-obvious performance bottlenecks and rejects seemingly beneficial optimizations that actually harm performance. The capabilities demonstrate that, RL can transform an initially poor-performing LLM into an effective CUDA optimizer through speedup-based reward signals alone, without human expertise or domain knowledge. This paradigm opens possibilities for automated optimization of CUDA operations, and holds promise to substantially promote GPU efficiency and alleviate the rising pressure on GPU computing resources.

large language model, machine learning, reinforcement learning, (20 more...)

arXiv.org Artificial Intelligence

2507.14111

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

SDformer: Similarity-driven Discrete Transformer For Time Series Generation

Neural Information Processing SystemsOct-10-2025, 20:51:44 GMT

Comprehensive experiments show that our method significantly outperforms competing approaches in terms of the generated time series quality while also ensuring a short inference time.

dataset, sdformer, time sery, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > United States > Hawaii > Honolulu County > Honolulu (0.04)
(4 more...)

Genre:

Research Report > Experimental Study (0.93)
Research Report > Promising Solution (0.67)

Industry:

Health & Medicine (0.68)
Information Technology (0.67)
Education (0.46)
Banking & Finance (0.46)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(3 more...)

Add feedback

Fidel-TS: A High-Fidelity Benchmark for Multimodal Time Series Forecasting

Xu, Zhijian, Cai, Wanxu, Dai, Xilin, Deng, Zhaorong, Xu, Qiang

arXiv.org Machine LearningSep-30-2025

The evaluation of time series forecasting models is hindered by a critical lack of high-quality benchmarks, leading to a potential illusion of progress. Existing datasets suffer from issues ranging from pre-training data contamination in the age of LLMs to the causal and description leakage prevalent in early multimodal designs. To address this, we formalize the core principles of high-fidelity benchmarking, focusing on data sourcing integrity, strict causal soundness, and structural clarity. We introduce Fidel-TS, a new large-scale benchmark built from the ground up on these principles by sourcing data from live APIs. Our extensive experiments validate this approach by exposing the critical biases and design limitations of prior benchmarks. Furthermore, we conclusively demonstrate that the causal relevance of textual information is the key factor in unlocking genuine performance gains in multimodal forecasting.

benchmark, dataset, forecasting, (16 more...)

arXiv.org Machine Learning

2509.24789

Country: