AITopics | pred

Collaborating Authors

pred

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

NOVA: ABenchmark for Anomaly Localization and Clinical Reasoning in Brain MRI Anonymous Author(s) Affiliation Address email

Neural Information Processing SystemsJun-20-2026, 01:59:28 GMT

Section C introduces5 the three main tasks defined in our benchmark: abnormality detection, radiological image captioning,6 and clinical reasoning through differential diagnosis.

artificial intelligence, large language model, natural language, (14 more...)

Neural Information Processing Systems

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (0.36)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.30)

Add feedback

Supplementary AViT 3B model

Neural Information Processing SystemsApr-25-2026, 06:33:50 GMT

The ViT model we use in this work is based on a standard Vision Transformer [7] model scaled to577 nearly 3 billion parameters, using a patch size of 14, 16 heads, 64 blocks, an MLP dimension of 8192578 and a hidden dimension of 2048. The model is defined and trained in Lingvo [32]; we additionally579 employ GSPMD [41] for training. The model is pre-trained on JFT-3B [35] using training settings580 that optimize for performance on JFT-3B rather than for fine-tuning on ImageNet; notably, we do not581 use the training recipe that helps few-shot transfer performance [44]. BReview tools586 We include screenshots of the reviewing tools we built to analyze model mistakes. Figure 3 shows587 the UI for reviewing model predictions and Figure 4 shows the UI that displays the labeling guide588 and slide bar to browse images for a particular class.

artificial intelligence, machine learning, pred, (12 more...)

Neural Information Processing Systems

Country: North America (0.14)

Industry: Transportation > Ground (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Explanations based on the Missing: Towards Contrastive Explanations with Pertinent Negatives

Amit Dhurandhar, Pin-Yu Chen, Ronny Luss, Chun-Chen Tu, Paishun Ting, Karthikeyan Shanmugam, Payel Das

Neural Information Processing SystemsFeb-14-2026, 12:42:39 GMT

In this paper we propose a novel method that provides contrastive explanations justifying the classification of an input by a black box classifier such as a deep neuralnetwork.

artificial intelligence, explanation, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States (0.04)
North America > Canada > Quebec > Montreal (0.04)

Genre: Research Report (0.49)

Industry: Health & Medicine > Therapeutic Area > Neurology > Autism (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.48)

Add feedback

Automated Classification of Model Errors on ImageNet

Neural Information Processing SystemsFeb-14-2026, 12:42:05 GMT

While the ImageNet dataset has been driving computer vision research over the past decade, significant label noise and ambiguity have made top-1 accuracy an insufficient measure of further progress.

artificial intelligence, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Europe > United Kingdom > England > Staffordshire (0.04)
Asia > China (0.04)
(6 more...)

Genre: Research Report (0.68)

Industry:

Transportation > Passenger (1.00)
Transportation > Ground > Road (1.00)
Leisure & Entertainment > Sports (1.00)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Unsupervised Adversarial Invariance

Ayush Jaiswal, Rex Yue Wu, Wael Abd-Almageed, Prem Natarajan

Neural Information Processing SystemsFeb-12-2026, 03:27:31 GMT

Data representations that contain all the information about target variables but are invariant to nuisance factors benefit supervised learning algorithms by preventing them from learning associations between these factors and the targets, thus reducing overfitting.

artificial intelligence, information, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
North America > Canada > Quebec > Montreal (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

Add feedback

Two-StreamNetworkforSignLanguageRecognition andTranslation

Neural Information Processing SystemsFeb-9-2026, 15:45:10 GMT

Weadoptidentical dataaugmentationsforRGBvideos andheatmap sequences to maintain spatial and temporal consistency. SingleStream-SLTwhich only utilizes asingle video encoder without modelling keypoints serves as our baseline. TwoStream-SLT-V/K/J denotes the network where only one translation network is attached onto the video head/keypoint head/joint head. The averaged probabilities are used to decode text sequences. In each of the variants, only a single translation network is appended onto the video head, keypoint head, or joint head.

artificial intelligence, pred, twostream-slr, (18 more...)

Neural Information Processing Systems

Country: South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.05)

Technology: Information Technology > Artificial Intelligence (0.48)

Add feedback

SupplementaryMaterialsfor TVLT: TextlessVision-LanguageTransformer

Neural Information Processing SystemsFeb-8-2026, 12:17:50 GMT

LanguageInput CMU-MOSEI(A2) HT100MYTT-S Audio 75.3 76.8 Text(ASR-SpeechBrain) 76.5 76.6 Text(ASR-Google) 77.1 77.8 Text(GTTranscripts) 78.9 79.1 Table 2 shows the results ofTVLTon CMUMOSEI sentiment analysis withthe following different inputs: audio, ASR-based text, and ground-truth text transcriptions. ASR-Google and ASR-SpeechBrain refer to Google Cloud API and SpeechBrain, respectively (see main paper Sec. He is underhousearrestandhis mother takesaway his XboxesandTVsissort of a little bit of additionalpunishment. 0.0 -1.0 0.0 0.0 And then last year we had 260 something come outtothedance 1.0 2.0 2.0 1.0 Weusetheconfigurations asfollows: (1)Wesetasingle speech event to have a duration within [0.3s, 1.2s], so that an event is likely to cover a single word. If the silence gap is too large, it is usually a stop between two words. Specifically,weconstruct a 4-layer transformer language model that attends toTVLT encoder outputs via cross-attentions and jointly train the encoder anddecoder.

artificial intelligence, natural language, supplementarymaterialsfor tvlt, (12 more...)

Neural Information Processing Systems

Country: Europe > Serbia (0.05)

Technology: Information Technology > Artificial Intelligence > Natural Language (0.78)

Add feedback

2cd5737c59645f7ef23b2842b705edf2-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 02:26:56 GMT

context gt, pred, training image gt, (10 more...)

Neural Information Processing Systems

Country:

North America > United States > Virginia (0.04)
North America > United States > Maryland (0.04)
North America > Canada > Newfoundland and Labrador > Newfoundland (0.04)
(3 more...)

Industry: Transportation > Ground (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

PRED: Pre-training via Semantic Rendering on LiDAR Point Clouds

Neural Information Processing SystemsDec-26-2025, 08:07:41 GMT

Pre-training is crucial in 3D-related fields such as autonomous driving where point cloud annotation is costly and challenging. Many recent studies on point cloud pre-training, however, have overlooked the issue of incompleteness, where only a fraction of the points are captured by LiDAR, leading to ambiguity during the training phase. On the other hand, images offer more comprehensive information and richer semantics that can bolster point cloud encoders in addressing the incompleteness issue inherent in point clouds. Yet, incorporating images into point cloud pre-training presents its own challenges due to occlusions, potentially causing misalignments between points and pixels. In this work, we propose PRED, a novel image-assisted pre-training framework for outdoor point clouds in an occlusion-aware manner. The main ingredient of our framework is a Birds-Eye-View (BEV) feature map conditioned semantic rendering, leveraging the semantics of images for supervision through neural rendering. We further enhance our model's performance by incorporating point-wise masking with a high mask ratio (95%). Extensive experiments demonstrate PRED's superiority over prior point cloud pre-training methods, providing significant improvements on various large-scale datasets for 3D perception tasks. Codes will be available at https://github.com/PRED4pc/PRED.

point cloud, pre-training, semantic rendering, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

A Categorical Analysis of Large Language Models and Why LLMs Circumvent the Symbol Grounding Problem

Floridi, Luciano, Jia, Yiyang, Tohmé, Fernando

arXiv.org Artificial IntelligenceDec-11-2025

This paper presents a formal, categorical framework for analysing how humans and large language models (LLMs) transform content into truth-evaluated propositions about a state space of possible worlds W , in order to argue that LLMs do not solve but circumvent the symbol grounding problem.

large language model, machine learning, natural language, (22 more...)

arXiv.org Artificial Intelligence

2512.09117

Country: