AITopics | fmap

Collaborating Authors

fmap

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

whichimpliesthat: Pr(ˆq q 1 d(1/ n+ϵ)) e nϵ

Neural Information Processing SystemsFeb-9-2026, 19:30:06 GMT

To extend this and adapt other results to our setting, we could now apply the Simulation Lemma [1]to bound the value difference given the model error,or alternatively, develop the theory in the direction of[55]andrelated work. Code is available at https://github.com/spitis/mocoda Forexample, in2d Navigation,themaskfunction was implementedasfollows: def Mask2dNavigation(input_tensor): """ accepts B x num_sa_features, and returns B x num_parents x num_children """ # base local mask mask = torch.tensor( Theadvantageofthisapproach isthat we can easily do conditional sampling incase of overlapping parent sets. The CQL implementation uses SAC [17].

artificial intelligence, machine learning, tensor, (19 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.95)

Add feedback

Generating ensembles of spatially-coherent in-situ forecasts using flow matching

Landry, David, Monteleoni, Claire, Charantonis, Anastase

arXiv.org Artificial IntelligenceApr-28-2025

We propose a machine-learning-based methodology for in-situ weather forecast postprocessing that is both spatially coherent and multivariate. Compared to previous work, our Flow MAtching Postprocessing (FMAP) better represents the correlation structures of the observations distribution, while also improving marginal performance at the stations. FMAP generates forecasts that are not bound to what is already modeled by the underlying gridded prediction and can infer new correlation structures from data. The resulting model can generate an arbitrary number of forecasts from a limited number of numerical simulations, allowing for low-cost forecasting systems. A single training is sufficient to perform postprocessing at multiple lead times, in contrast with other methods which use multiple trained networks at generation time. This work details our methodology, including a spatial attention transformer backbone trained within a flow matching generative modeling framework. FMAP shows promising performance in experiments on the EUPPBench dataset, forecasting surface temperature and wind gust values at station locations in western Europe up to five-day lead times.

artificial intelligence, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2504.03463

Country:

North America > United States (0.28)
Europe > Western Europe (0.24)

Genre:

Research Report > Experimental Study (0.46)
Research Report > New Finding (0.46)

Industry: Energy (0.67)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.93)

Add feedback

On the Inherent Robustness of One-Stage Object Detection against Out-of-Distribution Data

Martinez-Seras, Aitor, Del Ser, Javier, Andres, Alain, Garcia-Bringas, Pablo

arXiv.org Artificial IntelligenceNov-7-2024

Robustness is a fundamental aspect for developing safe and trustworthy models, particularly when they are deployed in the open world. In this work we analyze the inherent capability of one-stage object detectors to robustly operate in the presence of out-of-distribution (OoD) data. Specifically, we propose a novel detection algorithm for detecting unknown objects in image data, which leverages the features extracted by the model from each sample. Differently from other recent approaches in the literature, our proposal does not require retraining the object detector, thereby allowing for the use of pretrained models. Our proposed OoD detector exploits the application of supervised dimensionality reduction techniques to mitigate the effects of the curse of dimensionality on the features extracted by the model. Furthermore, it utilizes high-resolution feature maps to identify potential unknown objects in an unsupervised fashion. Our experiments analyze the Pareto trade-off between the performance detecting known and unknown objects resulting from different algorithmic configurations and inference confidence thresholds. We also compare the performance of our proposed algorithm to that of logits-based post-hoc OoD methods, as well as possible fusion strategies. Finally, we discuss on the competitiveness of all tested methods against state-of-the-art OoD approaches for object detection models over the recently published Unknown Object Detection benchmark. The obtained results verify that the performance of avant-garde post-hoc OoD detectors can be further improved when combined with our proposed algorithm.

configuration, detection, detector, (16 more...)

arXiv.org Artificial Intelligence

2411.04586

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Europe > Spain > Basque Country > Biscay Province > Bilbao (0.04)
South America > Argentina > Patagonia > Río Negro Province > Viedma (0.04)
Europe > Monaco (0.04)

Genre: Research Report > New Finding (0.93)

Industry: Government (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(3 more...)

Add feedback

A Multi-Level Superoptimizer for Tensor Programs

Wu, Mengdi, Cheng, Xinhao, Padon, Oded, Jia, Zhihao

arXiv.org Artificial IntelligenceMay-9-2024

For We introduce Mirage, the first multi-level superoptimizer for a given algorithm, these optimizers automatically generate tensor programs. A key idea in Mirage is Graphs, a uniform performant kernels by searching over possible strategies for representation of tensor programs at the kernel, thread block, executing the kernel on the target hardware. However, due and thread levels of the GPU compute hierarchy. Graphs to the linear algebra nature of DNNs, a tensor program can enable Mirage to discover novel optimizations that combine be represented by a wide spectrum of mathematically equivalent algebraic transformations, schedule transformations, and algorithms, and existing schedule-based optimizers only generation of new custom kernels. To navigate the large consider kernels whose algorithms are manually specified search space, Mirage introduces a pruning technique based by users, resulting in missed optimization opportunities.

graph, mirage, operator, (15 more...)

arXiv.org Artificial Intelligence

2405.05751

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
North America > United States > California > Santa Clara County > Palo Alto (0.14)
North America > United States > New York > New York County > New York City (0.04)
(3 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

HarDNN: Feature Map Vulnerability Evaluation in CNNs

Mahmoud, Abdulrahman, Hari, Siva Kumar Sastry, Fletcher, Christopher W., Adve, Sarita V., Sakr, Charbel, Shanbhag, Naresh, Molchanov, Pavlo, Sullivan, Michael B., Tsai, Timothy, Keckler, Stephen W.

arXiv.org Machine LearningFeb-25-2020

As Convolutional Neural Networks (CNNs) are increasingly being employed in safety-critical applications, it is important that they behave reliably in the face of hardware errors. Transient hardware errors may percolate undesirable state during execution, resulting in software-manifested errors which can adversely affect high-level decision making. This paper presents HarDNN, a software-directed approach to identify vulnerable computations during a CNN inference and selectively protect them based on their propensity towards corrupting the inference output in the presence of a hardware error. We show that HarDNN can accurately estimate relative vulnerability of a feature map (fmap) in CNNs using a statistical error injection campaign, and explore heuristics for fast vulnerability assessment. Based on these results, we analyze the tradeoff between error coverage and computational overhead that the system designers can use to employ selective protection. Results show that the improvement in resilience for the added computation is superlinear with HarDNN. For example, HarDNN improves SqueezeNet's resilience by 10x with just 30% additional computations.

fmap, relative vulnerability, vulnerability, (13 more...)

arXiv.org Machine Learning

2002.09786

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Illinois (0.04)
(2 more...)

Genre: Research Report > New Finding (0.34)

Industry: Information Technology > Security & Privacy (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.90)

Add feedback