false alarm
A Benchmark of Causal vs Correlation AI for Predictive Maintenance
Taduri, Krishna, Dhande, Shaunak, Paolo, Giacinto, Saggese, null, Smith, Paul
Predictive maintenance in manufacturing environments presents a challenging optimization problem characterized by extreme cost asymmetry, where missed failures incur costs roughly fifty times higher than false alarms. Conventional machine learning approaches typically optimize statistical accuracy metrics that do not reflect this operational reality and cannot reliably distinguish causal relationships from spurious correlations. This study evaluates eight predictive models, ranging from baseline statistical approaches to formal causal inference methods, on a dataset of 10,000 CNC machines with a 3.3 percent failure prevalence. The formal causal inference model (L5) achieved estimated annual cost savings of 1.16 million USD (a 70.2 percent reduction), outperforming the best correlation-based decision tree model (L3) by approximately 80,000 USD per year. The causal model matched the highest observed recall (87.9 percent) while reducing false alarms by 97 percent (from 165 to 5) and attained a precision of 92.1 percent, with a train-test performance gap of only 2.6 percentage points. These results indicate that causal AI methods, when combined with domain knowledge, can yield superior financial outcomes and more interpretable predictions compared to correlation-based approaches in predictive maintenance applications.
- North America > United States > Maryland > Prince George's County > College Park (0.14)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.72)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.72)
Real-Time Long Horizon Air Quality Forecasting via Group-Relative Policy Optimization
Kang, Inha, Kim, Eunki, Ryu, Wonjeong, Shin, Jaeyo, Yu, Seungjun, Kang, Yoon-Hee, Jeong, Seongeun, Kim, Eunhye, Kim, Soontae, Shim, Hyunjung
Accurate long horizon forecasting of particulate matter (PM) concentration fields is essential for operational public health decisions. However, achieving reliable forecasts remains challenging in regions with complex terrain and strong atmospheric dynamics such as East Asia. While foundation models such as Aurora offer global generality, they often miss region-specific dynamics and rely on non-real-time inputs, limiting their practical utility for localized warning systems. T o address this gap, we construct and release the real-world observations and high-resolution CMAQ-OBS dataset for East Asia, reducing regional error by 59.5% and enabling real-time 48-120 hour forecasts critical for public health alerts. However, standard point-wise objectives cannot reflect asymmetric operational costs, where false alarms deteriorate public trust while missed severe events endanger populations. This cost mismatch causes SFT models to over-predict and yield high False Alarm Rates. W e introduce Group-Relative Policy Optimization (GRPO) with class-wise rewards and curriculum rollout to align predictions with operational priorities. Experimental results demonstrate that our framework significantly improves the reliability of the forecast. Compared to the SFT-only baseline, our model reduces the False Alarm Rate by 47.3% while achieving a competitive F1-score, proving its effectiveness for practical, real-world air quality forecasting systems on long lead time scenarios.
- Asia > East Asia (0.46)
- North America > United States > California > San Francisco County > San Francisco (0.14)
- Asia > China (0.05)
- (3 more...)
Beyond Ranked Lists: The SARAL Framework for Cross-Lingual Document Set Retrieval
Agarwal, Shantanu, Barry, Joel, Boschee, Elizabeth, Miller, Scott
Machine Translation for English Retrieval of Information in Any Language (MATERIAL) is an IARPA initiative targeted to advance the state of cross-lingual information retrieval (CLIR). This report provides a detailed description of Information Sciences Institute's (ISI's) Summarization and domain-Adaptive Retrieval Across Language's (SARAL's) effort for MATERIAL. Specifically, we outline our team's novel approach to handle CLIR with emphasis in developing an approach amenable to retrieve a query-relevant document \textit{set}, and not just a ranked document-list. In MATERIAL's Phase-3 evaluations, SARAL exceeded the performance of other teams in five out of six evaluation conditions spanning three different languages (Farsi, Kazakh, and Georgian).
- North America > United States > California (0.14)
- Europe > France > Provence-Alpes-Côte d'Azur > Bouches-du-Rhône > Marseille (0.05)
- North America > United States > Ohio > Franklin County > Columbus (0.04)
- Africa > East Africa (0.04)
SynCast: Synergizing Contradictions in Precipitation Nowcasting via Diffusion Sequential Preference Optimization
Xu, Kaiyi, Gong, Junchao, Zhang, Wenlong, Fei, Ben, Bai, Lei, Ouyang, Wanli
Precipitation nowcasting based on radar echoes plays a crucial role in monitoring extreme weather and supporting disaster prevention. Although deep learning approaches have achieved significant progress, they still face notable limitations. For example, deterministic models tend to produce over-smoothed predictions, which struggle to capture extreme events and fine-scale precipitation patterns. Probabilistic generative models, due to their inherent randomness, often show fluctuating performance across different metrics and rarely achieve consistently optimal results. Furthermore, precipitation nowcasting is typically evaluated using multiple metrics, some of which are inherently conflicting. For instance, there is often a trade-off between the Critical Success Index (CSI) and the False Alarm Ratio (FAR), making it challenging for existing models to deliver forecasts that perform well on both metrics simultaneously. To address these challenges, we introduce preference optimization into precipitation nowcasting for the first time, motivated by the success of reinforcement learning from human feedback in large language models. Specifically, we propose SynCast, a method that employs the two-stage post-training framework of Diffusion Sequential Preference Optimization (Diffusion-SPO), to progressively align conflicting metrics and consistently achieve superior performance. In the first stage, the framework focuses on reducing FAR, training the model to effectively suppress false alarms. Building on this foundation, the second stage further optimizes CSI with constraints that preserve FAR alignment, thereby achieving synergistic improvements across these conflicting metrics.
Formally Exploring Time-Series Anomaly Detection Evaluation Metrics
Wagner, Dennis, Nair, Arjun, Franks, Billy Joe, Arweiler, Justus, Muraleedharan, Aparna, Jungjohann, Indra, Hartung, Fabian, Ahuja, Mayank C., Balinskyy, Andriy, Varshneya, Saurabh, Syed, Nabeel Hussain, Nagda, Mayank, Liznerski, Phillip, Reithermann, Steffen, Rudolph, Maja, Vollmer, Sebastian, Schulz, Ralf, Katz, Torsten, Mandt, Stephan, Bortz, Michael, Leitte, Heike, Neider, Daniel, Burger, Jakob, Jirasek, Fabian, Hasse, Hans, Fellenz, Sophie, Kloft, Marius
Undetected anomalies in time series can trigger catastrophic failures in safety-critical systems, such as chemical plant explosions or power grid outages. Although many detection methods have been proposed, their performance remains unclear because current metrics capture only narrow aspects of the task and often yield misleading results. We address this issue by introducing verifiable properties that formalize essential requirements for evaluating time-series anomaly detection. These properties enable a theoretical framework that supports principled evaluations and reliable comparisons. Analyzing 37 widely used metrics, we show that most satisfy only a few properties, and none satisfy all, explaining persistent inconsistencies in prior results. To close this gap, we propose LARM, a flexible metric that provably satisfies all properties, and extend it to ALARM, an advanced variant meeting stricter requirements.
- Europe > Germany > Rhineland-Palatinate > Kaiserslautern (0.04)
- North America > United States > Wisconsin > Dane County > Madison (0.04)
- North America > United States > California > Orange County > Irvine (0.04)
- (2 more...)
Probabilistic Machine Learning for Uncertainty-Aware Diagnosis of Industrial Systems
Mohammadi, Arman, Krysander, Mattias, Jung, Daniel, Frisk, Erik
Deep neural networks has been increasingly applied in fault diagnostics, where it uses historical data to capture systems behavior, bypassing the need for high-fidelity physical models. However, despite their competence in prediction tasks, these models often struggle with the evaluation of their confidence. This matter is particularly important in consistency-based diagnosis where decision logic is highly sensitive to false alarms. To address this challenge, this work presents a diagnostic framework that uses ensemble probabilistic machine learning to improve diagnostic characteristics of data driven consistency based diagnosis by quantifying and automating the prediction uncertainty. The proposed method is evaluated across several case studies using both ablation and comparative analyses, showing consistent improvements across a range of diagnostic metrics.
- Europe > Sweden (0.05)
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
- North America > United States > California (0.04)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
- (2 more...)
Dynamic Aware: Adaptive Multi-Mode Out-of-Distribution Detection for Trajectory Prediction in Autonomous Vehicles
Trajectory prediction is central to the safe and seamless operation of autonomous vehicles (AVs). In deployment, however, prediction models inevitably face distribution shifts between training data and real-world conditions, where rare or underrepresented traffic scenarios induce out-of-distribution (OOD) cases. While most prior OOD detection research in AVs has concentrated on computer vision tasks such as object detection and segmentation, trajectory-level OOD detection remains largely underexplored. A recent study formulated this problem as a quickest change detection (QCD) task, providing formal guarantees on the trade-off between detection delay and false alarms [1]. Building on this foundation, we propose a new framework that introduces adaptive mechanisms to achieve robust detection in complex driving environments. Empirical analysis across multiple real-world datasets reveals that prediction errors -- even on in-distribution samples -- exhibit mode-dependent distributions that evolve over time with dataset-specific dynamics. By explicitly modeling these error modes, our method achieves substantial improvements in both detection delay and false alarm rates. Comprehensive experiments on established trajectory prediction benchmarks show that our framework significantly outperforms prior UQ- and vision-based OOD approaches in both accuracy and computational efficiency, offering a practical path toward reliable, driving-aware autonomy.
- Asia > China > Beijing > Beijing (0.04)
- Asia > Singapore (0.04)
- North America > United States > Massachusetts > Suffolk County > Boston (0.04)
- Transportation > Ground > Road (0.94)
- Transportation > Infrastructure & Services (0.68)
Real-time, Adaptive Radiological Anomaly Detection and Isotope Identification Using Non-negative Matrix Factorization
Jones, Chandler, Bandstra, Mark, Faaland, Stefan, Lai, Yue Shi, Abgrall, Nico, Suchyta, Scott, Cooper, Reynold
Spectroscopic anomaly detection and isotope identification algorithms are integral components in nuclear nonproliferation applications such as search operations. The task is especially challenging in the case of mobile detector systems due to the fact that the observed gamma-ray background changes more than for a static detector system, and a pretrained background model can easily find itself out of domain. The result is that algorithms may exceed their intended false alarm rate, or sacrifice detection sensitivity in order to maintain the desired false alarm rate. Non-negative matrix factorization (NMF) has been shown to be a powerful tool for spectral anomaly detection and identification, but, like many similar algorithms that rely on data-driven background models, in its conventional implementation it is unable to update in real time to account for environmental changes that affect the background spectroscopic signature. We have developed a novel NMF-based algorithm that periodically updates its background model to accommodate changing environmental conditions. The Adaptive NMF algorithm involves fewer assumptions about its environment, making it more generalizable than existing NMF-based methods while maintaining or exceeding detection performance on simulated and real-world datasets.
- North America > United States > Tennessee > Anderson County > Oak Ridge (0.04)
- North America > United States > New Mexico (0.04)
- North America > United States > District of Columbia > Washington (0.04)
- (9 more...)
- Health & Medicine (1.00)
- Energy (1.00)
- Government > Regional Government > North America Government > United States Government (0.93)
- Government > Military (0.68)
Eufy PoE Bullet Security Camera E40 review: Professional grade
The Eufy PoE Bullet Security Camera E40, along with Eufy's Network Video Recorder S4, is a strong choice for homeowners and small business owners who want the enhanced security and reliability of hardwired cameras; plus, local AI and local storage that eliminates the need for a subscription. Add-on camera, 129.99 (requires Eufy Network Video Recorder S4, 399.99) The Eufy PoE Bullet Security Camera E40 is aimed at homeowners and small business owners who want the reliability of wired infrastructure, along with local storage of security camera recordings to eliminate the cost of a cloud subscription. It's built for people who take their security seriously and are willing to pull cables through their walls to get it. The camera must be paired with Eufy's PoE NVR, which you'll likewise need to hardwire to your home network.
- Information Technology > Artificial Intelligence (1.00)
- Information Technology > Communications > Networks (0.50)
Refining Time Series Anomaly Detectors using Large Language Models
Yang, Alan, Chen, Yulin, Lee, Sean, Montes, Venus
Time series anomaly detection (TSAD) is of widespread interest across many industries, including finance, healthcare, and manufacturing. Despite the development of numerous automatic methods for detecting anomalies, human oversight remains necessary to review and act upon detected anomalies, as well as verify their accuracy. We study the use of multimodal large language models (LLMs) to partially automate this process. We find that LLMs can effectively identify false alarms by integrating visual inspection of time series plots with text descriptions of the data-generating process. By leveraging the capabilities of LLMs, we aim to reduce the reliance on human effort required to maintain a TSAD system
- Information Technology > Data Science > Data Mining > Anomaly Detection (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.38)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.31)