AITopics | mse 0

Collaborating Authors

mse 0

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

This Time is Different An Perspective on Time Series Foundation Models

Neural Information Processing SystemsJun-17-2026, 00:56:59 GMT

We introduce TOTO, a time series forecasting foundation model with 151 million parameters. TOTO uses a modern decoder-only architecture coupled with architectural innovations designed to account for specific challenges found in multivariate observability time series data. TOTO's pre-training corpus is a mixture of observability data, open datasets, and synthetic data, and is 4-10 larger than those of leading time series foundation models. Additionally, we introduce BOOM, a large-scale benchmark consisting of 350 million observations across 2,807 real-world time series. For both TOTO and BOOM, we source observability data exclusively from Datadog's own telemetry and internal observability metrics. Extensive evaluations demonstrate that TOTO achieves state-of-the-art performance on both BOOM and on established general purpose time series forecasting benchmarks.

data mining, machine learning, urlhttp, (20 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Genre: Research Report (0.47)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.94)
Information Technology > Data Science > Data Mining (0.86)

Add feedback

Scalable Decision-Focused Learning through Cost-Sensitive Regression

Schutte, Noah, Berden, Senne, Guns, Tias, Postek, Krzysztof, Yorke-Smith, Neil

arXiv.org Machine LearningMay-19-2026

Many real-world combinatorial problems involve uncertain parameters, which can be predicted given contextual features and historical data. These `predict-then-optimize' or `contextual optimization' problems have gained significant attention: end-to-end training methods can now minimize the downstream task cost rather than the predictive error. However, despite their effectiveness, these decision-focused learning (DFL) approaches often rely on repeated solving of the underlying combinatorial optimization problem during training, making them computationally expensive and difficult to scale. We reframe the learning problem as a cost-sensitive multi-output regression problem: multi-output due to the combinatorial problem having multiple uncertain parameters, and cost-sensitive due to the downstream task cost being the real target. Our technical contribution is the formalization of multiple loss function components that follow from this reframing: cost-insensitive normalization, decision-aware asymmetric penalization of over- and underpredictions, and instance-based costs that mimic the true downstream task-based loss locally. These components require zero or one solve per training data instance, while requiring no further solves during training. Experiments show that the combination of loss components achieves comparable downstream task quality to the state of the art, while being significantly more efficient, enabling scaling to problem sizes that have not been tackled before with DFL.

artificial intelligence, instance-based cost, machine learning, (19 more...)

arXiv.org Machine Learning

2605.18005

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Appendix

Neural Information Processing SystemsApr-25-2026, 03:48:18 GMT

In this appendix, we first introduce the datasets and evaluation metrics used in the experiments in Section A. Then, we provide extra experimental results in Section B. In Section C, we present details of network design, training scheme, and hyper-parameter tuning. We conduct experiments on 11 popular time series datasets: (1) Electricity Transformer Temperature [42] (ETTh(1,2),ETTm1) 3consists of 2 year electric power data collected from two separated counties of China. Each data point includes an "oil temperature" value and 6 power load features. The data is aggregated into 5-minutes windows, resulting in 12 points per hour and 288 points per day. A.1 Electricity Transformer Temperature (ETT) For data pre-processing, we perform zero-mean normalization, i.e., X We use Mean Absolute Errors (MAE) [17] and Mean Squared Errors (MSE) [26] for model comparison.

artificial intelligence, dataset, machine learning, (19 more...)

Neural Information Processing Systems

Country: North America > United States > California (0.29)

Industry:

Energy > Power Industry (1.00)
Energy > Renewable > Solar (0.33)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

37c6d0bc4d2917dcbea693b18504bd87-Paper-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 04:23:16 GMT

data mining, machine learning, natural language, (22 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Francisco County > San Francisco (0.04)
North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
(2 more...)

Genre: Research Report > Experimental Study (0.93)

Industry:

Government (0.67)
Banking & Finance (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Data Science > Data Mining (0.68)
(3 more...)

Add feedback

266983d0949aed78a16fa4782237dea7-Paper-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 23:33:34 GMT

forecasting, scinet, time sery, (15 more...)

Neural Information Processing Systems

Country:

North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.04)
Asia > China > Hong Kong (0.04)

Genre: Research Report (0.93)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Modeling & Simulation (0.94)
Information Technology > Artificial Intelligence > Natural Language (0.69)

Add feedback

0898f05f6c1d247be3eab8da93d33da1-Paper-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 14:24:18 GMT

dataset, experiment, forecasting, (15 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.04)
Asia > Myanmar > Tanintharyi Region > Dawei (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry: Information Technology (0.46)

Technology:

Information Technology > Data Science > Data Mining (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Artificial Intelligence > Natural Language (0.67)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.67)

Add feedback

This Time is Different: An Observability Perspective on Time Series Foundation Models

Cohen, Ben, Khwaja, Emaad, Doubli, Youssef, Lemaachi, Salahidine, Lettieri, Chris, Masson, Charles, Miccinilli, Hugo, Ramé, Elise, Ren, Qiqi, Rostamizadeh, Afshin, Terrail, Jean Ogier du, Toon, Anna-Monica, Wang, Kan, Xie, Stephan, Xu, Zongzhe, Zhukova, Viktoriya, Asker, David, Talwalkar, Ameet, Abou-Amal, Othmane

arXiv.org Artificial IntelligenceNov-6-2025

We introduce Toto, a time series forecasting foundation model with 151 million parameters. Toto uses a modern decoder-only architecture coupled with architectural innovations designed to account for specific challenges found in multivariate observability time series data. Toto's pre-training corpus is a mixture of observability data, open datasets, and synthetic data, and is 4-10$\times$ larger than those of leading time series foundation models. Additionally, we introduce BOOM, a large-scale benchmark consisting of 350 million observations across 2,807 real-world time series. For both Toto and BOOM, we source observability data exclusively from Datadog's own telemetry and internal observability metrics. Extensive evaluations demonstrate that Toto achieves state-of-the-art performance on both BOOM and on established general purpose time series forecasting benchmarks. Toto's model weights, inference code, and evaluation scripts, as well as BOOM's data and evaluation code, are all available as open source under the Apache 2.0 License available at https://huggingface.co/Datadog/Toto-Open-Base-1.0 and https://github.com/DataDog/toto.

large language model, machine learning, mae 0, (21 more...)

arXiv.org Artificial Intelligence

2505.14766

Country: North America (0.93)

Genre: Research Report (1.00)

Industry: Information Technology > Security & Privacy (0.45)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Modeling & Simulation (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.92)
(4 more...)

Add feedback

xTime: Extreme Event Prediction with Hierarchical Knowledge Distillation and Expert Fusion

Li, Quan, Yu, Wenchao, Wang, Suhang, Lin, Minhua, Chen, Lingwei, Cheng, Wei, Chen, Haifeng

arXiv.org Artificial IntelligenceOct-24-2025

Abstract--Extreme events frequently occur in real-world time series and often carry significant practical implications. In domains such as climate and healthcare, these events, such as floods, heatwaves, or acute medical episodes, can lead to serious consequences. Accurate forecasting of such events is therefore of substantial importance. Most existing time series forecasting models are optimized for overall performance within the prediction window, but often struggle to accurately predict extreme events, such as high temperatures or heart rate spikes. The main challenges are data imbalance and the neglect of valuable information contained in intermediate events that precede extreme events. In this paper, we propose xTime, a novel framework for extreme event forecasting in time series. In addition, we introduce a mixture of experts (MoE) mechanism that dynamically selects and fuses outputs from expert models across different rarity levels, which further improves the forecasting performance for extreme events. Experiments on multiple datasets show that xTime achieves consistent improvements, with forecasting accuracy on extreme events improving from 3% to 78%. Time series forecasting plays a fundamental role across a broad spectrum of critical applications, such as stock market analysis, weather and climate modeling, and electricity demand prediction.

data mining, expert model, machine learning, (22 more...)

arXiv.org Artificial Intelligence

2510.20651

Country: North America > United States (0.46)

Genre: Research Report > New Finding (0.46)

Industry: Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.48)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(3 more...)

Add feedback

SynTSBench: Rethinking Temporal Pattern Learning in Deep Learning Models for Time Series

Tan, Qitai, Chen, Yiyun, Li, Mo, Gu, Ruiwen, Su, Yilin, Zhang, Xiao-Ping

arXiv.org Artificial IntelligenceOct-24-2025

Recent advances in deep learning have driven rapid progress in time series forecasting, yet many state-of-the-art models continue to struggle with robust performance in real-world applications, even when they achieve strong results on standard benchmark datasets. This persistent gap can be attributed to the black-box nature of deep learning architectures and the inherent limitations of current evaluation frameworks, which frequently lack the capacity to provide clear, quantitative insights into the specific strengths and weaknesses of different models, thereby complicating the selection of appropriate models for particular forecasting scenarios. To address these issues, we propose a synthetic data-driven evaluation paradigm, SynTSBench, that systematically assesses fundamental modeling capabilities of time series forecasting models through programmable feature configuration. Our framework isolates confounding factors and establishes an interpretable evaluation system with three core analytical dimensions: (1) temporal feature decomposition and capability mapping, which enables systematic evaluation of model capacities to learn specific pattern types; (2) robustness analysis under data irregularities, which quantifies noise tolerance thresholds and anomaly recovery capabilities; and (3) theoretical optimum benchmark-ing, which establishes performance boundaries for each pattern type--enabling direct comparison between model predictions and mathematical optima. Our experiments show that current deep learning models do not universally approach optimal baselines across all types of temporal features.

artificial intelligence, deep learning, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2510.20273

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.92)

Industry:

Banking & Finance > Economy (0.67)
Energy (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback