AITopics | reformer

Collaborating Authors

reformer

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

FastTransformerswithClusteredAttention SupplementaryMaterial

Neural Information Processing SystemsFeb-11-2026, 03:58:47 GMT

WefirstclusterthequeriesQusingtheK-means clustering to outputS which indicates the membership of queries to different clusters. The lower half of the figure shows the new valueˆVt computed by sparse dot-products with the keysK and values V corresponding tothe the top-k keys inT. Figure 6: We show training/validation loss convergence for different transformer variants. Both the clustered variants are have a significantly better convergence than bothlsh-1 and lsh-4. Note that due to a smaller batch sizefullmakesmanymoreupdates than allother transformer variants. In figure 6a, we show the training loss convergence for different transformer variants.

artificial intelligence, machine learning, variant, (17 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland (0.06)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.05)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.49)

Add feedback

f6a8dd1c954c8506aadc764cc32b895e-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-11-2026, 03:58:30 GMT

sequence length, suggestion, transformer, (15 more...)

Neural Information Processing Systems

Genre: Summary/Review (0.41)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.32)

Add feedback

85�% Kernel & Hash Construction

Neural Information Processing SystemsFeb-9-2026, 22:15:53 GMT

Section 4.2 demonstrates addressthechallenges (Figure 1 contains of Scatterbrain).

artificial intelligence, arxivpreprintarxiv, machine learning, (10 more...)

Neural Information Processing Systems

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)

Industry:

Semiconductors & Electronics (0.46)
Information Technology (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.71)

Add feedback

84c2d4860a0fc27bcf854c444fb8b400-Paper.pdf

Neural Information Processing SystemsFeb-9-2026, 15:47:04 GMT

dimension, proceedings, representation, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > California > Los Angeles County > Los Angeles (0.14)
Europe > Switzerland > Zürich > Zürich (0.14)
(8 more...)

Genre: Research Report > Experimental Study (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

47d40767c7e9df50249ebfd9c7cfff77-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-8-2026, 07:24:35 GMT

reduce memory, reformer, softmax denominator, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.34)

Add feedback

Mamba Outpaces Reformer in Stock Prediction with Sentiments from Top Ten LLMs

Kadiyala, Lokesh Antony, Mirzaeinia, Amir

arXiv.org Artificial IntelligenceOct-3-2025

The stock market is extremely difficult to predict in the short term due to high market volatility, changes caused by news, and the non-linear nature of the financial time series. This research proposes a novel framework for improving minute-level prediction accuracy using semantic sentiment scores from top ten different large language models (LLMs) combined with minute interval intraday stock price data. We systematically constructed a time-aligned dataset of AAPL news articles and 1-minute Apple Inc. (AAPL) stock prices for the dates of April 4 to May 2, 2025. The sentiment analysis was achieved using the DeepSeek-V3, GPT variants, LLaMA, Claude, Gemini, Qwen, and Mistral models through their APIs. Each article obtained sentiment scores from all ten LLMs, which were scaled to a [0, 1] range and combined with prices and technical indicators like RSI, ROC, and Bollinger Band Width. Two state-of-the-art such as Reformer and Mamba were trained separately on the dataset using the sentiment scores produced by each LLM as input. Hyper parameters were optimized by means of Optuna and were evaluated through a 3-day evaluation period. Reformer had mean squared error (MSE) or the evaluation metrics, and it should be noted that Mamba performed not only faster but also better than Reformer for every LLM across the 10 LLMs tested. Mamba performed best with LLaMA 3.3--70B, with the lowest error of 0.137. While Reformer could capture broader trends within the data, the model appeared to over smooth sudden changes by the LLMs. This study highlights the potential of integrating LLM-based semantic analysis paired with efficient temporal modeling to enhance real-time financial forecasting.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2510.01203

Country: North America > United States > Texas (0.28)

Genre: Research Report (0.82)

Industry: Banking & Finance > Trading (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

47d40767c7e9df50249ebfd9c7cfff77-AuthorFeedback.pdf

Neural Information Processing SystemsOct-2-2025, 20:06:19 GMT

We thank the reviewers for their valuable comments! Unclear if the proposed method is better than only using LSH. Thank you for the suggestions. ALSH significantly outperforms the E2LSH and the Reformer LSH scheme. SMYRF-BERT base (see also Table 2).

artificial intelligence, reformer, softmax denominator, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.34)

Add feedback

Fast Transformers with Clustered Attention Supplementary Material

Neural Information Processing SystemsAug-17-2025, 08:06:52 GMT

Figure 1: Flow-chart demonstrating the compuation for clustered attention. For more details refer to 1.1 or 3.2 in the main paper. Work done at Idiap 34th Conference on Neural Information Processing Systems (NeurIPS 2020), V ancouver, Canada. We then present the flow chart demonstrating the same. This is followed by taking the weighted average of the 3 correponding values.

artificial intelligence, machine learning, variant, (17 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.24)
Europe > Switzerland > Vaud > Lausanne (0.04)
Europe > Switzerland > Geneva > Geneva (0.04)

Industry: Leisure & Entertainment > Sports > Football (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

encouraged that reviewers find our paper clear and well written (R1, R2, R3) and our method to be theoretically sound

Neural Information Processing SystemsAug-17-2025, 08:06:33 GMT

We would like to thank the reviewers for their helpful comments and their thorough evaluation of our work. Reversible layers is a technique introduced by Gomez et al. (2017) and is orthogonal and In contrast, clustered attention places no such restriction. We will also add Set Transformers to the related work section. Is speech favorable to clustering? We would like to mention that our NLP approximation experiment for GLUE and SQuAD tasks in 4.3 shows that NLP/vision tasks in the long context setting, as suggested.

artificial intelligence, machine learning, sequence length, (17 more...)

Neural Information Processing Systems

Genre: Summary/Review (0.41)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.32)

Add feedback

85% 9 5% Kernel & Hash Construction

Neural Information Processing SystemsAug-16-2025, 00:05:59 GMT

To better understand this trade-off, we observe that sparse and low-rank approximations excel in different regimes, determined by the softmax temperature in attention, and sparse + low-rank can outperform each individually.

machine learning, natural language, scatterbrain, (16 more...)

Neural Information Processing Systems

Country: